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SEQUENCE LISTING 



(1) GENERAlX INFORMATION: 



T£CH CENTER ix„ 



(i) APPDJCANT: 



BIRKELUND, Svend 
CHRISTIANSEN, Gunna 
HEBSGAARD PEDERSEN, Anna-Sofiej 
MYGIND, Per 
KNUDSEN, Katrine 



(ii) TITLE OF INVENTION: SURFACE EXPOSED PRO/^INS FROM CHLAMYDIA 
PNEUMONIAE 




(iii) NUMBER OF SEQUENCES: 30 

(iv) CORRESPONDENCES ADDRESS: 

(A) ADDRESSED: BROWDY AND NEIMARKJ/P . L . L . C . 

(B) STREET: 62L4 Ninth Street, N.W^ Suite 300 

(C) CITY: Washington 

(D) STATE: D.C.N 

(E) COUNTRY: US/ 

(F) ZIP: 20001 

(v) COMPUTER READABLE F^DRM: 

(A) MEDIUM TYPE: Floppy dis 

(B) COMPUTER: IBM POL compa/tir/le 

(C) OPERATING SYSTEm\ PC^OS/MS-DOS 

(D) SOFTWARE: PatentlV Bfeleafse #1.0, Version #1.30 

(vi) CURRENT APPLICATION DAI 

(A) APPLICATION NUMBER : \0 9 / 4 4 6 , 67 7 

(B) FILING DATE: 24->MAR-\oOO 

(vii) PRIOR APPLICATION D&TA: 

(A) APPLICATION NUMBER: PC4/DK98/00266 

(B) FILING DATE:/l9-JUN-199N 

vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DK 07\4/97 

(B) FILING D^TE: 23-JUN-1997 

(viii) ATTORNEY/AGE^T INFORMATION: 

(A) NAME: /oOPER, Iver P. 

(B) REGISTRATION NUMBER: 28,005 

(C) REFERENCE/ DOCKET NUMBER: BIRKE^UND=1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-628-5197 

(B) TELEFAX: 202-737-3528 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 3200 base pairs 

(B) TYPE: nucleic acid 
r (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 205... 2987 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CAATGTCGAA GAGAGCACTA AC C AG GAAAA TTGCGATTTC ATAAACCCAC T T TAT TAT T A 60 
AATTCTTACT TGCGTCATAT AAAATAGAAA ACTCAGAGAG T CAAGAT AAA AATTCTTGAC 12 0 
AGCTGTTTTG TCATCTTTAA CTTGATTTAC TTATTTTGTT T C TAT AT T G A TGCGAATAGT 18 0 
TCTCTAAAAA ACAAAAGCAT TACC ATG AAG ACT TCG ATT CCT TGG GTT TTA 231 

Met Lys Thr Ser lie Pro Trp Val Leu 
1 5 

GTT TCC TCC GTG TTA GCT TTC TCA TGT CAC CTA CAG TCA CTA GCT AAC 27 9 

Val Ser Ser Val Leu Ala Phe Ser Cys His Leu Gin Ser Leu Ala Asn 
10 15 20 25 

GAG GAA CTT TTA TCA CCT GAT GAT AGC TTT AAT GGA AAT ATC GAT TCA 327 
Glu Glu Leu Leu Ser Pro Asp Asp Ser Phe Asn Gly Asn lie Asp Ser 
30 35 40 

GGA ACG TTT ACT CCA AAA ACT TCA GCC ACA ACA TAT TCT CTA ACA GGA 37 5 

Gly Thr Phe Thr Pro Lys Thr Ser Ala Thr Thr Tyr Ser Leu Thr Gly 
45 50 55 

GAT GTC TTC TTT TAC GAG CCT GGA AAA GGC ACT CCC TTA TCT GAC AGT 42 3 

Asp Val Phe Phe Tyr Glu Pro Gly Lys Gly Thr Pro Leu Ser Asp Ser 

60 65 70. 

TGT TTT AAG CAA ACC ACG GAC AAT CTT ACC TTC TTG GGG AAC GGT CAT 471 
Cys Phe Lys Gin Thr Thr Asp Asn Leu Thr Phe Leu Gly Asn Gly His 
75 80 85 

AGC TTA ACG TTT GGC TTT ATA GAT GCT GGC ACT CAT GCA GGT GCT GCT 519 
Ser Leu Thr Phe Gly Phe lie Asp Ala Gly Thr His Ala Gly Ala Ala 
90 95 100 105 

GCA TCT ACA ACA GCA AAT AAG AAT CTT ACC TTC TCA GGG TTT TCC TTA 5 67 

Ala Ser Thr Thr Ala Asn Lys Asn Leu Thr Phe Ser Gly Phe Ser Leu 
110 115 120 

CTG AGT TTT GAT TCC TCT CCT AGC ACA ACG GTT ACT ACA GGT CAG GGA 615 
Leu Ser Phe Asp Ser Ser Pro Ser Thr Thr Val Thr Thr Gly Gin Gly 
125 130 135 

ACG CTT TCC TCA GCA GGA GGC GTA AAT TTA GAA AAT ATT CGT AAA CTT 663 
Thr Leu Ser Ser Ala Gly Gly Val Asn Leu Glu Asn lie Arg Lys Leu 
140 145 150 

GTA GTT GCT GGG AAT TTT TCT ACT GCA GAT GGT GGA GCT ATC AAA GGA 711 
Val Val Ala Gly Asn Phe Ser Thr Ala Asp Gly Gly Ala lie Lys Gly 
155 160 165 

GCG TCT TTC CTT TTA ACT GGC ACT TCT GGA GAT GCT CTT TTT AGT AAC 75 9 

Ala Ser Phe Leu Leu Thr Gly Thr Ser Gly Asp Ala Leu Phe Ser Asn 
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170 

AAC TCT TCA 
Asn Ser Ser 



CGC ATA GCA 
Arg lie Ala 



TCT ACG TCA 
Ser Thr Ser 
220 

AAC AAC AAA 
Asn Asn Lys 
235 

GGT GCG ATC 
Gly Ala lie 
250 

TCT AAC AAT 
Ser Asn Asn 



GGT GGC GCC 
Gly Gly Ala 



ACA GAG TTT 
Thr Glu Phe 
300 

GCT ATC AGC 
Ala lie Ser 
315 

GGA AAC ATT 
Gly Asn lie 
330 

GAT ACT CCT 
Asp Thr Pro 



ACG GAA TTA 
Thr Glu Leu 



ATC ACT TCA 
lie Thr Ser 
380 

TCT GCG GGA 
Ser Ala Gly 
395 



175 

TCA ACA AAG 
Ser Thr Lys 
190 

AAT AAC ACA 
Asn Asn Thr 
205 

GGA GGC GCT 
Gly Gly Ala 



TTT CTA TAT 
Phe Leu Tyr 



TGC AAC ACC 
Cys Asn Thr 
255 

AAG ACT CTG 
Lys Thr Leu 
270 

ATC CAT GCT 
lie His Ala 
285 

CTA CGA AAT 
Leu Arg Asn 



ATC GAT GCC 
lie Asp Ala 



ACC TTT GTA 
Thr Phe Val 
335 

AAA CGT AAT 
Lys Arg Asn 
350 

CGG GCT GCT 
Arg Ala Ala 
365 

GAA GGA ACC 
Glu Gly Thr 



GCT CTC AAT 
Ala Leu Asn 



GGA GGA GCA 
Gly Gly Ala 



GGT TAT GTT 
Gly Tyr Val 
210 

ATC GAT GAT 
lie Asp Asp 
225 

TTT GAA GGG 
Phe Glu Gly 
240 

AAG GCG AGT 
Lys Ala Ser 



ATC TTT GCT 
lie Phe Ala 



AAA AAG CTA 
Lys Lys Leu 
290 

AAT GTC TCA 
Asn Val Ser 
305 

TCA GGA GAG 
Ser Gly Glu 
320 

AGA AAT ACC 
Arg Asn Thr 



GCG ATC AAC 
Ala lie Asn 



AAA AAT CAT 
Lys Asn His 
370 

TCA TCA GAC 
Ser Ser Asp 
385 

CCA TAT CAA 
Pro Tyr Gin 
400 



180 

ATT GCT ACT 
lie Ala Thr 
195 

AGA TTC CTA 
Arg Phe Leu 



GAA GGC ACG 
Glu Gly Thr 



AAT GCA GCG 
Asn Ala Ala 
245 

GGA TCT CCT 
Gly Ser Pro 
260 

TCA AAC GTA 
Ser Asn Val 
275 

GCC CTT TCC 
Ala Leu Ser 



TCA GCA ACT 
Ser Ala Thr 



CTC AGT CTT 
Leu Ser Leu 
325 

CTT ACA ACA 
Leu Thr Thr 
340 

ATA GGA AGT 
lie Gly Ser 
355 

ACA ATT TTC 
Thr lie Phe 



GTA TTG AAG 
Val Leu Lys 



GGA ACG ATT 
Gly Thr lie 
405 



ACA GCA GGC 
Thr Ala Gly 
200 

TCT AAC ATA 
Ser Asn lie 
215 

TCG ATA CTA 
Ser lie Leu 
230 

AAA ACT ACT 
Lys Thr Thr 



GAA CTG ATA 
Glu Leu lie 



GCA GAA ACA 
Ala Glu Thr 
280 

TCT GGA GGC 
Ser Gly Gly 
295 

CCT AAG GGG 
Pro Lys Gly 
310 

TCT GCA GAG 
Ser Ala Glu 



ACC GGA AGT 
Thr Gly Ser 



AAC GGG AAA 
Asn Gly Lys 
360 

TTC TAT GAT 
Phe Tyr Asp 
375 

ATA AAT AAC 
lie Asn Asn 
390 

CTA TTT TCT 
Leu Phe Ser 



185 

GCT 807 
Ala 



GCG 855 
Ala 



TCG 903 
Ser 



GGC 951 
Gly 



ATC 999 

He 

265 

AGC 1047 
Ser 



TTT 1095 
Phe 



GGT 1143 
Gly 



ACA 1191 
Thr 



ACC 1239 

Thr 

345 

TTC 1287 
Phe 



CCC 1335 
Pro 



GGC 1383 
Gly 



GGA 1431 
Gly 
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GAA ACC CTA 
Glu Thr Leu 
410 

TCA TTC ACG 
Ser Phe Thr 



AAG GGA GTC 
Lys Gly Val 

CTC CTC GGC 
Leu Leu Gly 
460 

ATT ACA ATC 
He Thr He 
475 

CAG CCC GTC 
Gin Pro Val 
490 

TCT GGG AAG 
Ser Gly Lys 



CAT ATG TTC 
His Met Phe 



GAT GCT GAT 
Asp Ala Asp 
540 

CCT GCT GAG 
Pro Ala Glu 
555 

GTT AAT TGG 
Val Asn Trp 
570 

ACT TGG ACC 
Thr Trp Thr 



TTA GTA TGC 
Leu Val Cys 

CAA CAG CTT 
Gin Gin Leu 
620 

TTC TGG GTT 
Phe Trp Val 



ACA GCA GAT 
Thr Ala Asp 
415 

CAG CCA GTC 
Gin Pro Val 
430 

ACT TTA GAG 
Thr Leu Glu 
445 

ATG GAT TCA 
Met Asp Ser 



ACG AAC CTA 
Thr Asn Leu 



AGC CTA ACA 
Ser Leu Thr 
495 

CTC AAC CTG 
Leu Asn Leu 
510 

AGC CAT GAC 
Ser His Asp 
525 

GTT GAT ACT 
Val Asp Thr 



GAT CCT AAT 
Asp Pro Asn 



ACT ACG GAT 
Thr Thr Asp 
575 

AAA ACA GGA 
Lys Thr Gly 
590 

AAT ACC CTA 
Asn Thr Leu 
605 

GTA GAG ATC 
Val Glu He 



TCC TCC ATG 
Ser Ser Met 



GAA CTT AAA 
Glu Leu Lys 



TCC CTA TCC 
Ser Leu Ser 



AGC ACG AGC 
Ser Thr Ser 
450 

GGA ACG ACA 
Gly Thr Thr 
465 

GGA ATC AAT 
Gly He Asn 
480 

GCA AAA GGT 
Ala Lys Gly 



ATT GAT ATT 

He Asp He 



CAG CTC TTC 
Gin Leu Phe 
530 

AAC GTT GAC 
Asn Val Asp 
545 

TCA GAA TAC 
Ser Glu Tyr 
560 

ACA GCT ACA 
Thr Ala Thr 



TTT GTT CCC 
Phe Val Pro 



TGG GGA GTC 
Trp Gly Val 
610 

GGC GCA ACT 
Gly Ala Thr 
625 

ACG AAC TTC 
Thr Asn Phe 



GTT GCT GAC 
Val Ala Asp 
420 

GGA GGA AAG 
Gly Gly Lys 
435 

TTC TCT CAA 
Phe Ser Gin 



TTA TCA ACT 
Leu Ser Thr 



GTT GAC TCC 
Val Asp Ser 
485 

GCT TCA AAT 
Ala Ser Asn 
500 

GAA GGG AAC 

Glu Gly Asn 
515 

TCT CTA TTA 

Ser Leu Leu 



ATC AGC AGC 
He Ser Ser 



GGA TTC CAA 
Gly Phe Gin 
565 

AAT ACA AAA 
Asn Thr Lys 
580 

AGC CCC GAA 
Ser Pro Glu 
595 

TTT ACT GAC 
Phe Thr Asp 



GGT ATG GAA 
Gly Met Glu 



CTG CAT AAG 
Leu His Lys 



AAT TTA AAA 
Asn Leu Lys 

TTA TTG CTA 
Leu Leu Leu 
440 

GAG GCC GGT 
Glu Ala Gly 
455 

ACA GCT GGG 
Thr Ala Gly 
470 

TTA GGT CTT 
Leu Gly Leu 



AAA GTG ATC 
Lys Val He 



ATT TAT GAA 
He Tyr Glu 
520 

AAA ATC ACG 
Lys He Thr 
535 

CTT ATC CCT 
Leu He Pro 
550 

GGA CAA TGG 
Gly Gin Trp 



GAG GCC ACG 
Glu Ala Thr 



AGA AAA TCT 
Arg Lys Ser 
600 

ATT CGC TCT 
He Arg Ser 
615 

CAC AAA CAA 
His Lys Gin 
630 

ACT GGA GAT 
Thr Gly Asp 



TCT 1479 

Ser 

425 

CAA 1527 
Gin 



TCT 1575 
Ser 



AGT 1623 
Ser 



AAG 1671 
Lys 



GTA 1719 

Val 

505 

AGT 17 67 

Ser 



GTT 1815 
Val 



GTT 1863 
Val 



AAT 1911 
Asn 



GCA 1959 

Ala 

585 

GCG 2007 
Ala 



CTG 2055 
Leu 



GGT 2103 
Gly 



GAA 2151 
Glu 



635 

AAT CGC AAA 
Asn Arg Lys 
650 

AGT GCT CAC 
Ser Ala His 



CTC TTT GCT 
Leu Phe Ala 



ACC TAC GGT 
Thr Tyr Gly 
700 

CAA AAC TAT 
Gin Asn Tyr 
715 

GAA AAA TTC 
Glu Lys Phe 
730 

TTC AGC CAT 
Phe Ser His 



GAA TCC GAA 
Glu Ser Glu 



CTA GAC CTT 
Leu Asp Leu 
780 

TTC ATT CCA 
Phe lie Pro 
795 

TTC TTC GAA 
Phe Phe Glu 
810 

CTT AAC CTC 
Leu Asn Leu 



GGA GAT TCC 
Gly Asp Ser 



TAT CGT AAC 
Tyr Arg Asn 
860 




GGC TTC CGT 
Gly Phe Arg 
655 

ACT CCT AAA 
Thr Pro Lys 
670 

AGA GAC AAA 
Arg Asp Lys 
685 

GGA ACT TTA 
Gly Thr Leu 



TTG AGA TTA 
Leu Arg Leu 



CCT AGG GAA 
Pro Arg Glu 

735 

TCA GAC AAC 
Ser Asp Asn 
750 

GGT TCT TGG 
Gly Ser Trp 
765 

CCT TTT GTT 
Pro Phe Val 



CAG ATG AAA 
Gin Met Lys 



AGC TCT AGT 
Ser Ser Ser 
815 

TCG ATT CCT 
Ser lie Pro 
830 

TAC ACC TAT 
Tyr Thr Tyr 
845 

AAT CCC CAA 
Asn Pro Gin 



640 

CAT ACC TCT 
His Thr Ser 



GAC GAC CTA 
Asp Asp Leu 



GAT TGT TTT 
Asp Cys Phe 
690 

TTC TTC AAG 
Phe Phe Lys 
705 

GGA AGA GCA 
Gly Arg Ala 
720 

ATT CCC CTA 
lie Pro Leu 



CGT ATG GAA 
Arg Met Glu 



AGC AAC GAG 
Ser Asn Glu 
770 

CTT TCC AAC 
Leu Ser Asn 
785 

GTC GAA ATG 
Val Glu Met 
800 

GAT GGC CGT 
Asp Gly Arg 



GTG GGT GCG 
Val Gly Ala 



GAT CTC TCA 
Asp Leu Ser 
850 

TCT ACA GCG 
Ser Thr Ala 
865 
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645 

GGA GGC TAC 
Gly Gly Tyr 
660 

TTT ACC TTT 
Phe Thr Phe 
675 

ATC GCT CAC 
lie Ala His 



CAC TCT CAT 
His Ser His 



AAG TTT TCT 
Lys Phe Ser 
725 

GCC TTG GAT 
Ala Leu Asp 
740 

ACG CAC TAT 
Thr His Tyr 
755 

TGT ATA GCT 

Cys lie Ala 



CCA CAT CCT 
Pro His Pro 



GTT TAT GTA 

Val Tyr Val 
805 

GGT TTT AGT 

Gly Phe Ser 
820 

AAA TTC GTG 

Lys Phe Val 
835 

GGA TTC TTT 

Gly Phe Phe 



ACT CTT GTG 
Thr Leu Val 



GTC ATC GGT 
Val lie Gly 



GCG TTC TGC 
Ala Phe Cys 
680 

AAC AAC TCT 
Asn Asn Ser 
695 

ACC CTA CAA 
Thr Leu Gin 
710 

GAA TCA GCT 
Glu Ser Ala 



GTC CAA GTT 
Val Gin Val 



ACC TCA TTG 
Thr Ser Leu 
760 

GGT GGT ATC 
Gly Gly He 
775 

CTT TTC AAG 
Leu Phe Lys 
790 

TCA CAA AAT 
Ser Gin Asn 



ATT GGA AGG 
He Gly Arg 



CAG GGG GAT 
Gin Gly Asp 
840 

GTT TCC GAT 
Val Ser Asp 
855 

ATG AGC CCA 
Met Ser Pro 
870 



GGA 219 9 

Gly 

665 

CAT 2247 
His 



AGA 2295 
Arg 



CCC 2343 
Pro 



ATA 2391 
He 



TCG 2439 

Ser 

745 

CCA 2487 
Pro 



GGC 2535 
Gly 



ACC 25 83 

Thr 



AGC 2 631 

Ser 



CTG 2679 

Leu 

825 

ATC 2727 
He 



GTC 2775 
Val 



GAC 2823 
Asp 
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TCT TGG AAA ATT CGC GGT GGC AAT CTT TCA AGA CAG GCA TTT TTA CTG 2 871 

Ser Trp Lys lie Arg Gly Gly Asn Leu Ser Arg Gin Ala Phe Leu Leu 
875 880 885 

AGG GGT AGC AAC AAC TAC GTC TAC AAC TCC AAT TGT GAG CTC TTC GGA 2 919 

Arg Gly Ser Asn Asn Tyr Val Tyr Asn Ser Asn Cys Glu Leu Phe Gly 
890 895 900 905 

CAT TAC GCT ATG GAA CTC CGT GGA TCT TCA AGG AAC TAC AAT GTA GAT 2 9 67 

His Tyr Ala Met Glu Leu Arg Gly Ser Ser Arg Asn Tyr Asn Val Asp 
910 915 920 

GTT GGT ACC AAA CTC CGA TT CTAGATTGCT AAAACTCCCT AGTTCTTCTA GGGAG 3 022 
Val Gly Thr Lys Leu Arg Phe 
925 

TTTTCTCATA CTTTTAGGGA AATATTTGCT ATAGGGAATG CTTTCCTTGC AAAC T GT AAA 3 082 

AAATAAC AT T TGTCCCTCTT CAAAAAAGAT TTCTTTTAAT AATTTCTAGT TATAATTTTA 3142 

TTTTAAAAAC AGT TAAATAA T T AAT AG AC A ATAATCTATT CTTATTGACT TCTTTTTT 32 00 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Lys Thr Ser lie Pro Trp Val Leu Val Ser Ser Val Leu Ala Phe 
15 10 15 

Ser Cys His Leu Gin Ser Leu Ala Asn Glu Glu Leu Leu Ser Pro Asp 
20 25 30 

Asp Ser Phe Asn Gly Asn lie Asp Ser Gly Thr Phe Thr Pro Lys Thr 
35 40 45 

Ser Ala Thr Thr Tyr Ser Leu Thr Gly Asp Val Phe Phe Tyr Glu Pro 
50 55 60 

Gly Lys Gly Thr Pro Leu Ser Asp Ser Cys Phe Lys Gin Thr Thr Asp 
65 70 75 80 

Asn Leu Thr Phe Leu Gly Asn Gly His Ser Leu Thr Phe Gly Phe lie 
85 90 95 

Asp Ala Gly Thr His Ala Gly Ala Ala Ala Ser Thr Thr Ala Asn Lys 

100 105 110 

Asn Leu Thr Phe Ser Gly Phe Ser Leu Leu Ser Phe Asp Ser Ser Pro 
115 120 125 



Ser Thr Thr Val Thr Thr Gly Gin Gly Thr Leu Ser Ser Ala Gly Gly 
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130 135 140 

Val Asn Leu Glu Asn lie Arg Lys Leu Val Val Ala Gly Asn Phe Ser 
145 150 155 160 

Thr Ala Asp Gly Gly Ala lie Lys Gly Ala Ser Phe Leu Leu Thr Gly 
165 170 175 

Thr Ser Gly Asp Ala Leu Phe Ser Asn Asn Ser Ser Ser Thr Lys Gly 
180 185 190 

Gly Ala lie Ala Thr Thr Ala Gly Ala Arg lie Ala Asn Asn Thr Gly 
195 200 205 

Tyr Val Arg Phe Leu Ser Asn lie Ala Ser Thr Ser Gly Gly Ala lie 
210 215 220 

Asp Asp Glu Gly Thr Ser lie Leu Ser Asn Asn Lys Phe Leu Tyr Phe 
225 230 235 240 

Glu Gly Asn Ala Ala Lys Thr Thr Gly Gly Ala lie Cys Asn Thr Lys 
245 250 255 

Ala Ser Gly Ser Pro Glu Leu lie lie Ser Asn Asn Lys Thr Leu lie 
260 265 270 

Phe Ala Ser Asn Val Ala Glu Thr Ser Gly Gly Ala lie His Ala Lys 
275 280 285 

Lys Leu Ala Leu Ser Ser Gly Gly Phe Thr Glu Phe Leu Arg Asn Asn 
290 295 300 

Val Ser Ser Ala Thr Pro Lys Gly Gly Ala lie Ser lie Asp Ala Ser 
305 310 315 320 

Gly Glu Leu Ser Leu Ser Ala Glu Thr Gly Asn lie Thr Phe Val Arg 
325 330 335 

Asn Thr Leu Thr Thr Thr Gly Ser Thr Asp Thr Pro Lys Arg Asn Ala 
340 345 350 

lie Asn lie Gly Ser Asn Gly Lys Phe Thr Glu Leu Arg Ala Ala Lys 
355 360 365 

Asn His Thr lie Phe Phe Tyr Asp Pro lie Thr Ser Glu Gly Thr Ser 
370 375 380 

Ser Asp Val Leu Lys lie Asn Asn Gly Ser Ala Gly Ala Leu Asn Pro 
385 390 395 400 

Tyr Gin Gly Thr lie Leu Phe Ser Gly Glu Thr Leu Thr Ala Asp Glu 
405 410 415 

Leu Lys Val Ala Asp Asn Leu Lys Ser Ser Phe Thr Gin Pro Val Ser 
420 425 430 



Leu Ser Gly Gly Lys Leu Leu Leu Gin Lys Gly Val Thr Leu Glu Ser 
435 440 445 
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Thr Ser Phe Ser Gin Glu Ala Gly Ser Leu Leu Gly Met Asp Ser Gly 
450 455 460 

Thr Thr Leu Ser Thr Thr Ala Gly Ser lie Thr lie Thr Asn Leu Gly 
465 470 475 480 

lie Asn Val Asp Ser Leu Gly Leu Lys Gin Pro Val Ser Leu Thr Ala 
485 490 495 

Lys Gly Ala Ser Asn Lys Val lie Val Ser Gly Lys Leu Asn Leu lie 
500 505 510 

Asp lie Glu Gly Asn lie Tyr Glu Ser His Met Phe Ser His Asp Gin 
515 520 525 

Leu Phe Ser Leu Leu Lys lie Thr Val Asp Ala Asp Val Asp Thr Asn 
530 535 540 

Val Asp lie Ser Ser Leu lie Pro Val Pro Ala Glu Asp Pro Asn Ser 
545 550 555 560 

Glu Tyr Gly Phe Gin Gly Gin Trp Asn Val Asn Trp Thr Thr Asp Thr 
565 570 575 

Ala Thr Asn Thr Lys Glu Ala Thr Ala Thr Trp Thr Lys Thr Gly Phe 
580 585 590 

Val Pro Ser Pro Glu Arg Lys Ser Ala Leu Val Cys Asn Thr Leu Trp 
595 600 605 

Gly Val Phe Thr Asp lie Arg Ser Leu Gin Gin Leu Val Glu lie Gly 
610 615 620 

Ala Thr Gly Met Glu His Lys Gin Gly Phe Trp Val Ser Ser Met Thr 
625 630 635 640 

Asn Phe Leu His Lys Thr Gly Asp Glu Asn Arg Lys Gly Phe Arg His 
645 650 655 

Thr Ser Gly Gly Tyr Val lie Gly Gly Ser Ala His Thr Pro Lys Asp 
660 665 670 

Asp Leu Phe Thr Phe Ala Phe Cys His Leu Phe Ala Arg Asp Lys Asp 
675 680 685 

Cys Phe lie Ala His Asn Asn Ser Arg Thr Tyr Gly Gly Thr Leu Phe 
690 695 700 

Phe Lys His Ser His Thr Leu Gin Pro Gin Asn Tyr Leu Arg Leu Gly 
705 710 715 720 

Arg Ala Lys Phe Ser Glu Ser Ala lie Glu Lys Phe Pro Arg Glu lie 
725 730 735 

Pro Leu Ala Leu Asp Val Gin Val Ser Phe Ser His Ser Asp Asn Arg 
740 745 750 
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Met Glu Thr His 
755 

Asn Glu Cys lie 
770 

Ser Asn Pro His 
785 

Glu Met Val Tyr 

Gly Arg Gly Phe 
820 

Gly Ala Lys Phe 
835 

Leu Ser Gly Phe 
850 

Thr Ala Thr Leu 

865 

Asn Leu Ser Arg 



Tyr Asn Ser Asn 
900 

Gly Ser Ser Arg 
915 



Tyr Thr Ser Leu 
760 

Ala Gly Gly lie 
775 

Pro Leu Phe Lys 
790 

Val Ser Gin Asn 
805 

Ser lie Gly Arg 



Val Gin Gly Asp 
840 

Phe Val Ser Asp 
855 

Val Met Ser Pro 
870 

Gin Ala Phe Leu 
885 

Cys Glu Leu Phe 



Asn Tyr Asn Val 
920 



Pro Glu Ser Glu 



Gly Leu Asp Leu 
780 

Thr Phe lie Pro 

795 

Ser Phe Phe Glu 
810 

Leu Leu Asn Leu 
825 

lie Gly Asp Ser 



Val Tyr Arg Asn 
860 

Asp Ser Trp Lys 
875 

Leu Arg Gly Ser 
890 

Gly His Tyr Ala 
905 

Asp Val Gly Thr 



Gly Ser Trp Ser 
765 

Pro Phe Val Leu 



Gin Met Lys Val 
800 

Ser Ser Ser Asp 
815 

Ser lie Pro Val 
830 

Tyr Thr Tyr Asp 
845 

Asn Pro Gin Ser 



lie Arg Gly Gly 

880 

Asn Asn Tyr Val 
895 

Met Glu Leu Arg 
910 

Lys Leu Arg Phe 
925 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATGAAATCGC AATTTTCCTG GTTAGTGCTC TCTTCGACAT TGGCATGTTT TACTAGTTGT 60 

TCCACTGTTT TTGCTGCAAC TGCTGAAAAT ATAGGCCCCT C T GAT AG CT T TGACGGAAGT 12 0 

ACTAACACAG GCACCTATAC TCCTAAAAAT ACGACTACTG GAATAGACTA TACTCTGACA 180 

G GAG AT AT AA CTCTGCAAAA CCTTGGGGAT TCGGCAGCTT T AAC GAAG G G TTGTTTTTCT 24 0 

GACACTACGG AAT CTTTAAG CTTTGCCGGT AAGGGGTACT CACTTTCTTT TTTAAATATT 3 00 

AAGTCTAGTG CTGAAGGCGC AGCACTTTCT GTTACAACTG AT AAAAAT C T GTCGCTAACA 3 60 

GGATTTTCGA GTCTTACTTT CTTAGCGGCC CCATCATCGG T AAT CAC AAC CCCCTCAGGA 42 0 

AAAGGT GCAG TTAAATGTGG AGGGGATCTT ACATTTGATA AC AAT G G AAC TATTTTATTT 4 80 

AAACAAGATT ACTGTGAGGA AAATGGCGGA GCCATTTCTA CCAAGAATCT TTCTTTGAAA 54 0 

AAC AG CAC G G GATCGATTTC TTTTGAAGGG AATAAATCGA G C G C AAC AG G GAAAAAAGGT 60 0 

GGGGCTATTT GTGCTACTGG TACTGTAGAT AT T ACAAAT A ATACGGCTCC TACCCTCTTC 660 

TCGAACAATA TTGCTGAAGC TGCAGGTGGA GCTATAAATA G CAC AG G AAA CT GTACAAT T 72 0 

ACAGGGAATA CGTCTCTTGT ATTTTCTGAA AATAGTGTGA CAGCGACCGC AGGAAATGGA 7 80 
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GGAGCTCTTT CTGGAGATGC CGATGTTACC ATATCTGGGA AT CAGAGT GT AACTTTCTCA 84 0 

GGAAACCAAG CTGTAGCTAA TGGCGGAGCC ATTTATGCTA AGAAGCTTAC ACTGGCTTCC 9 00 

GGGGGGGGGG GGGGTATCTC CTTTTCTAAC AATATAGTCC AAGGT AC CAC TGCAGGTAAT 9 60 

GGTGGAGCCA TTTCTATACT GGCAGCTGGA GAGTGTAGTC TTTCAGCAGA AGCAGGGGAC 102 0 

ATTACCTTCA ATGGGAATGC CATTGTTGCA ACTACACCAC AAACTACAAA AAGAAATTCT 10 8 0 

ATTGACATAG GAT C TACT GC AAAGATCACG AATTTACGTG CAATATCTGG G CAT AG CAT C 114 0 

TTTTTCTACG AT C C GAT T AC TGCTAATACG GCTGCGGATT C T AC AG AT AC TTTAAATCTC 12 0 0 

AATAAGGCTG ATGCAGGTAA TAGTACAGAT TATAGTGGGT CGATTGTTTT TTCTGGTGAA 12 60 

AAGCTCTCTG AAGATGAAGC AAAAGTTGCA GACAACCTCA CTTCTACGCT GAAGCAGCCT 132 0 

GTAACTCTAA CTGCAGGAAA TTTAGTACTT AAACGTGGTG TCACTCTCGA T AC GAAAG GC 13 8 0 

T T T AC T C AG A CCGCGGGTTC CTCTGTTATT ATGGATGCGG G CAC AAC GT T AAAAGCAAGT 14 4 0 

ACAGAGGAGG TCACTTTAAC AGGTCTTTCC ATTCCTGTAG ACTCTTTAGG CGAGGGTAAG 15 00 

AAAGTT GTAA TTGCTGCTTC TGCAGCAAGT AAAAAT GT AG CCCTTAGTGG TCCGATTCTT 15 60 

CTTTTGGATA ACCAAGGGAA TGCTTATGAA AATCACGACT TAGGAAAAAC TCAAGACTTT 162 0 

TCATTTGTGC AGCTCTCTGC TCTGGGTACT G CAACAACT A CAGATGTTCC AGCGGTTCCT 168 0 

ACAGTAGCAA CTCCTACGCA CTATGGGTAT C AAGGT ACT T GGGGAATGAC TTGGGTTGAT 17 4 0 

GAT AC C G C AA GCACTCCAAA GAC T AAGAC A G C GAC AT TAG CTTGGACCAA T AC AG G C T AC 18 00 

CTTCCGAATC CTGAGCGTCA AGGACCTTTA GTT CCTAATA GCCTTTGGGG ATCTTTTTCA 18 60 

GACATCCAAG C GAT T C AAG G T GT CAT AGAG AGAAGTGCTT TGACTCTTTG TTCAGATCGA 192 0 

GGCTTCTGGG CTGCGGGAGT CGCCAATTTC T T AG AT AAAG ATAAGAAAGG GGAAAAACGC 19 80 

AAATAC C GT C ATAAATCTGG TGGATATGCT ATCGGAGGTG CAGC GCAAAC TTGTTCTGAA 2 04 0 

AACTTAATTA GCTTTGCCTT TTGCCAACTC TTTGGTAGCG AT AAAG AT T T CTTAGTCGCT 2100 

AAAAAT CAT A C T GAT AC C T A TGCAGGAGCC T T CT AT AT C C AAC AC AT T AC AGAATGTAGT 2160 

GGGTTCATAG GTTGTCTCTT AG AT AAAC T T CCTGGCTCTT G GAG T C AT AA ACCCCTCGTT 222 0 

TTAGAAGGGC AGCTCGCTTA TAGCCACGTC AGTAATGATC TGAAGACAAA GTATACTGCG 22 8 0 

TATCCTGAGG TGAAAGGTTC TTGGGGGAAT AATGCTTTTA A CAT GAT GTT GGGAGCTTCT 234 0 

TCTCATTCTT ATCCTGAATA CCTGCATTGT TTTGATACCT ATGCTCCATA CAT CAAACT G 24 00 

AATCTGACCT AT AT AC GT CA GGACAGCTTC TCGGAGAAAG GTACAGAAGG AAGATCTTTT 24 60 

GAT GAC AG C A ACCTCTTCAA TTTATCTTTG CCTATAGGGG TGAAGTTTGA GAAGTTCTCT 252 0 

GATTGTAATG ACTTTTCTTA TGATCTGACT TTATCCTATG TTCCTGATCT TATCCGCAAT 258 0 

GATCCCAAAT G CAC T AC AG C ACTTGTAATC AGCGGAGCCT CTTGGGAAAC TTATGCCAAT 2 64 0 

AACTTAGCAC GACAGGCCTT GCAAGTGCGT GCAGGCAGTC ACTACGCCTT CTCTCCTATG 27 0 0 

TTTGAAGTGC TCGGCCAGTT TGTCTTTGAA GTTCGTGGAT CCTCACGGAT TTATAATGTA 27 60 

GATCTTGGGG GTAAGTTCCA ATTCTAGGAG CGTCTCTCAT GTCTCAGAAA TTCTG 2 815 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Lys Ser Gin Phe Ser Trp Leu Val Leu Ser Ser Thr Leu Ala Cys 
15 10 15 

Phe Thr Ser Cys Ser Thr Val Phe Ala Ala Thr Ala Glu Asn lie Gly 
20 25 30 

Pro Ser Asp Ser Phe Asp Gly Ser Thr Asn Thr Gly Thr Tyr Thr Pro 
35 40 45 



Lys Asn Thr Thr Thr Gly lie Asp Tyr Thr Leu Thr Gly Asp lie Thr 
50 55 60 
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Leu Gin Asn Leu Gly Asp Ser Ala Ala Leu Thr Lys Gly Cys Phe Ser 
65 70 75 80 

Asp Thr Thr Glu Ser Leu Ser Phe Ala Gly Lys Gly Tyr Ser Leu Ser 
85 90 95 

Phe Leu Asn lie Lys Ser Ser Ala Glu Gly Ala Ala Leu Ser Val Thr 
100 105 110 

Thr Asp Lys Asn Leu Ser Leu Thr Gly Phe Ser Ser Leu Thr Phe Leu 
115 120 125 

Ala Ala Pro Ser Ser Val lie Thr Thr Pro Ser Gly Lys Gly Ala Val 
130 135 140 

Lys Cys Gly Gly Asp Leu Thr Phe Asp Asn Asn Gly Thr lie Leu Phe 
145 150 155 160 

Lys Gin Asp Tyr Cys Glu Glu Asn Gly Gly Ala lie Ser Thr Lys Asn 
165 170 175 

Leu Ser Leu Lys Asn Ser Thr Gly Ser lie Ser Phe Glu Gly Asn Lys 
180 185 190 

Ser Ser Ala Thr Gly Lys Lys Gly Gly Ala lie Cys Ala Thr Gly Thr 
195 200 205 

Val Asp lie Thr Asn Asn Thr Ala Pro Thr Leu Phe Ser Asn Asn lie 
210 215 220 

Ala Glu Ala Ala Gly Gly Ala lie Asn Ser Thr Gly Asn Cys Thr lie 
225 230 235 240 

Thr Gly Asn Thr Ser Leu Val Phe Ser Glu Asn Ser Val Thr Ala. Thr 
245 250 255 

Ala Gly Asn Gly Gly Ala Leu Ser Gly Asp Ala Asp Val Thr lie Ser 

260 265 270 

Gly Asn Gin Ser Val Thr Phe Ser Gly Asn Gin Ala Val Ala Asn Gly 
275 280 285 

Gly Ala lie Tyr Ala Lys Lys Leu Thr Leu Ala Ser Gly Gly Gly Gly 
290 295 300 

Gly lie Ser Phe Ser Asn Asn lie Val Gin Gly Thr Thr Ala Gly Asn 
305 ' 310 315 320 

Gly Gly Ala lie Ser lie Leu Ala Ala Gly Glu Cys Ser Leu Ser Ala 
325 330 335 

Glu Ala Gly Asp lie Thr Phe Asn Gly Asn Ala lie Val Ala Thr Thr 
340 345 350 



Pro Gin Thr Thr Lys Arg Asn Ser lie Asp lie Gly Ser Thr Ala Lys 
355 360 365 
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He Thr Asn Leu Arg Ala He Ser Gly His Ser He Phe Phe Tyr Asp 
370 375 380 

Pro He Thr Ala Asn Thr Ala Ala Asp Ser Thr Asp Thr Leu Asn Leu 
385 390 395 400 

Asn Lys Ala Asp Ala Gly Asn Ser Thr Asp Tyr Ser Gly Ser He Val 
405 410 415 

Phe Ser Gly Glu Lys Leu Ser Glu Asp Glu Ala Lys Val Ala Asp Asn 
420 425 430 

Leu Thr Ser Thr Leu Lys Gin Pro Val Thr Leu Thr Ala Gly Asn Leu 
435 440 445 

Val Leu Lys Arg Gly Val Thr Leu Asp Thr Lys Gly Phe Thr Gin Thr 
450 455 460 

Ala Gly Ser Ser Val He Met Asp Ala Gly Thr Thr Leu Lys Ala Ser 
465 470 475 480 

Thr Glu Glu Val Thr Leu Thr Gly Leu Ser He Pro Val Asp Ser Leu 
485 490 495 

Gly Glu Gly Lys Lys Val Val lie Ala Ala Ser Ala Ala Ser Lys Asn 
500 505 510 

Val Ala Leu Ser Gly Pro He Leu Leu Leu Asp Asn Gin Gly Asn Ala 
515 520 525 

Tyr Glu Asn His Asp Leu Gly Lys Thr Gin Asp Phe Ser Phe Val Gin 
530 535 540 

Leu Ser Ala Leu Gly Thr Ala Thr Thr Thr Asp Val Pro Ala Val Pro 
545 550 555 560 

Thr Val Ala Thr Pro Thr His Tyr Gly Tyr Gin Gly Thr Trp Gly Met 
565 570 575 

Thr Trp Val Asp Asp Thr Ala Ser Thr Pro Lys Thr Lys Thr Ala Thr 
580 585 590 

Leu Ala Trp Thr Asn Thr Gly Tyr Leu Pro Asn Pro Glu Arg Gin Gly 
595 600 605 

Pro Leu Val Pro Asn Ser Leu Trp Gly Ser Phe Ser Asp He Gin Ala 
610 615 620 

He Gin Gly Val He Glu Arg Ser Ala Leu Thr Leu Cys Ser Asp Arg 
625 630 635 640 

Gly Phe Trp Ala Ala Gly Val Ala Asn Phe Leu Asp Lys Asp Lys Lys 
645 650 655 

Gly Glu Lys Arg Lys Tyr Arg His Lys Ser Gly Gly Tyr Ala He Gly 
660 665 670 



Gly Ala Ala 



Gin Thr 



Cys Ser Glu Asn Leu He Ser Phe Ala Phe Cys 
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675 680 685 

Gin Leu Phe Gly Ser Asp Lys Asp Phe Leu Val Ala Lys Asn His Thr 
690 695 700 

Asp Thr Tyr Ala Gly Ala Phe Tyr lie Gin His lie Thr Glu Cys Ser 
705 710 715 720 

Gly Phe lie Gly Cys Leu Leu Asp Lys Leu Pro Gly Ser Trp Ser His 
725 730 735 

Lys Pro Leu Val Leu Glu Gly Gin Leu Ala Tyr Ser His Val Ser Asn 
740 745 750 

Asp Leu Lys Thr Lys Tyr Thr Ala Tyr Pro Glu Val Lys Gly Ser Trp 
755 760 765 

Gly Asn Asn Ala Phe Asn Met Met Leu Gly Ala Ser Ser His Ser Tyr 
770 775 780 

Pro Glu Tyr Leu His Cys Phe Asp Thr Tyr Ala Pro Tyr lie Lys Leu 
785 790 795 800 

Asn Leu Thr Tyr lie Arg Gin Asp Ser Phe Ser Glu Lys Gly Thr Glu 
805 810 815 

Gly Arg Ser Phe Asp Asp Ser Asn Leu Phe Asn Leu Ser Leu Pro lie 
820 825 830 

Gly Val Lys Phe Glu Lys Phe Ser Asp Cys Asn Asp Phe Ser Tyr Asp 
835 840 845 

Leu Thr Leu Ser Tyr Val Pro Asp Leu lie Arg Asn Asp Pro Lys Cys 
850 855 860 

Thr Thr Ala Leu Val lie Ser Gly Ala Ser Trp Glu Thr Tyr Ala Asn 
865 870 875 880 

Asn Leu Ala Arg Gin Ala Leu Gin Val Arg Ala Gly Ser His Tyr Ala 
885 890 895 

Phe Ser Pro Met Phe Glu Val Leu Gly Gin Phe Val Phe Glu Val Arg 
900 905 910 

Gly Ser Ser Arg lie Tyr Asn Val Asp Leu Gly Gly Lys Phe Gin Phe 
915 920 925 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
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ATGCGATTTT CGCTCTGCGG ATTTCCTCTA GTTTTTTCTT TAACATTGCT CTCAGTCTTC 60 

GACACTTCTT TGAGTGCTAC TACGATTTCT TTAACCCCAG AAGATAGTTT T CAT G GAGAT 12 0 

AGTCAGAATG CAGAACGTTC TTATAATGTT CAAGCTGGGG AT GT CT AT AG CCTTACTGGT 180 

GATGTCTCAA TATCTAACGT C GAT AAC T C T G CAT T AAAT A AAGCCTGCTT CAATGTGACC 24 0 

TCAGGAAGTG TGACGTTCGC AGGAAATCAT CATGGGTTAT AT T T T AAT AA TATTTCCTCA 300 

GGAACTACAA AGGAAGGGGC TGTACTTTGT TGCCAAGATC CTCAAGCAAC GGCACGTTTT 3 60 

TCTGGGTTCT CCACGCTCTC TTTTATTCAG AGCCCCGGAG ATATTAAAGA ACAGGGAT GT 42 0 

CTCTATTCAA AAAATGCACT TATGCTCTTA AAC AAT TAT G TAGTGCGTTT TGAACAAAAC 4 80 

CAAAGTAAGA CTAAAGGCGG AGCTATTAGT GGGGCGAATG TTACTATAGT AG G C AAC T AC 54 0 

GATTCCGTCT CTTTCTATCA GAATGCAGCC ACTTTTGGAG GTGCTATCCA TTCTTCAGGT 600 

CCCCTACAGA TTGCAGTAAA TCAGGCAGAG ATAAGATTTG CACAAAATAC TGCCAAGAAT 6 60 

GGTTCTGGAG GGGCTTTGTA CTCCGATGGT GATATTGATA TTGATCAGAA TGCTTATGTT 72 0 

CTATTTCGAG AAAATGAGGC ATTGACTACT GCTATAGGTA AGGGAGGGGC TGTCTGTTGT 7 80 

CTTCCCACTT CAGGAAGTAG TACTCCAGTT CCTATTGTGA CTTTCTCTGA CAATAAACAG 84 0 

TTAGTCTTTG AAAGAAAC CA T T C CAT AAT G GGT GGCGGAG CCATTTATGC TAGGAAACTT 900 

AGCATCTCTT C AG GAG GT C C TACTCTATTT AT C AAT AAT A TAT CAT AT G C AAATTCGCAA 9 60 

AATTTAGGTG GAGCTATTGC CATTGATACT G GAG G G GAGA TCAGTTTATC AG C AGAGAAA 102 0 

G GAACAAT T A CATTCCAAGG AAACCGGACG AGCTTACCGT TTTTGAATGG CATCCATCTT 108 0 

TTACAAAATG CTAAATTCCT GAAAT T AC AG GCGAGAAATG GATGCTCTAT AGAATTTTAT 114 0 

GATCCTATTA CTTCTGAAGC AGATGGGTCT AC C C AAT T GA AT AT C AAC G G AGAT C CT AAA 12 00 

AATAAAGAGT AC AC AG G GAC CATACTCTTT TCTGGAGAAA AGAGT CTAGC AAACGATCCT 1260 

AGGGATTTTA AAT C T AC AAT CCCTCAGAAC GTCAACCTGT C T G C AG GAT A CTTAGTTATT 132 0 

AAAGAGGGGG CCGAAGTCAC AGTTTCAAAA TTCACGCAGT CTCCAGGATC GCATTTAGTT 13 8 0 

T T AGAT T TAG GAACCAAACT GATAGCCTCT AAG GAAGAC A TTGCCATCAC AGGCCTCGCG 14 4 0 

ATAGATATAG AT AG C T T AAG CTCATCCTCA AC AG C AG C T G TTATTAAAGC AAACACCGCA 15 00 

AATAAACAGA TATCCGTGAC G GAC T C TATA GAACTTATCT CGCCTACTGG C AAT G C C TAT 15 60 

GAAGATCTCA GAAT GAGAAA T T C AC AG AC G TTCCCTCTGC TCTCTTTAGA GCCTGGAGCC 1620 

GGGGGTAGTG TGACTGTAAC T G CT G GAGAT TTCCTACCGG TAAGTCCCCA TTATGGTTTT 168 0 

CAAGGCAATT G GAAAT TAG C T T G GAC AG G A AC T G GAAAC A AAGTTGGAGA ATTCTTCTGG 17 4 0 

GATAAAATAA AT TAT AAG C C TAG AC C T GAA AAAGAAG GAA ATTTAGTTCC T AAT AT C T T G 18 00 

TGGGGGAATG CTGTAAATGT CAGATCCTTA ATGCAGGTTC AAGAGAC C C A TGCATCGAGC 18 60 

T T AC A GAC AG ATCGAGGGCT GT G GAT C GAT GGAATTGGGA ATTTCTTCCA TGTATCTGCC 1920 

T C C GAAGAC A AT AT AAG GTA CCGTCATAAC AG C GGT G GAT ATGTTCTATC T GT AAAT AAT 1980 

GAGATCACAC CTAAGCACTA TACT TC GAT G GCATTTTCCC AACTCTTTAG TAGAGACAAG 2 04 0 

GACTATGCGG TTTCCAACAA C GAAT AC AGA AT GT AT T TAG GAT C GT AT C T C TAT C AAT AT 2100 

ACAACCTCCC TAGGGAATAT TTTCCGTTAT GCTTCGCGTA ACCCTAATGT AAACGTCGGG 2160 

ATTCTCTCAA GAAGGTTTCT TCAAAATCCT CT T AT GAT T T TTCATTTTTT GTGTGCTTAT 2220 

GGTCATGCCA C C AAT GAT AT GAAAACAGAC T AC G C AAAT T TCCCTATGGT GAAAAACAGC 22 8 0 

T G GAGAAAC A ATTGTTGGGC TATAGAGTGC G GAG G GAG C A TGCCTCTATT G GT AT T T GAG 2 34 0 

AAC G GAAGAC TTTTCCAAGG TGCCATCCCA T T TAT GAAAC TACAATTAGT TTATGCTTAT 2 4 00 

CAGGGAGATT TCAAAGAGAC GACTGCAGAT GGCCGTAGAT TTAGTAATGG GAGTTTAACA 2 4 60 

TCGATTTCTG TACCTCTAGG CATACGCTTT GAGAAGCTGG CACTTTCTCA GGATGTACTC 2520 

TATGACTTTA GTTTCTCCTA TATTCCTGAT ATTTTCCGTA AGGATCCCTC ATGTGAAGCT 2580 

GCTCTGGTGA T TAG C G GAGA CTCCTGGCTT GTTCCGGCAG CACAC GT AT C AAGAC AT G CT 2 64 0 

TTTGTAGGGA GTGGAACGGG TCGGTATCAC T T T AAC GAC T ATACT GAG C T CTTATGTCGA 2700 

G GAAGT AT AG AATGCCGCCC CCATGCTAGG AATTATAATA TAAACTGTGG AAG C AAAT T T 27 60 

CGTTTTTAGA AGGTTT CCAT TGCCTGTGTG GTTCCGGATC T T AAC T AT AA ATCCTGGACT 2 82 0 

AT G GAT CAT A GGCATTGGGT TTCTCGAACT TGTGTGGAGA ATAACGACAT T T TAT AT G C A 2 8 80 

T AAC G GAAT A CTCGTATCAC CTCAGCCCCT AGAGAC AT T C TTTAGGGGTT CTTTATTTGT 2 94 0 

CTAAACTTCG TATTTTATCG AGAATCCTTT ACGTTCTTGG TTTGCTTGTC TCCGAGGAGT 3000 

TCTCTAACGA AT CAT AG G G A TTCCAGGGTT CTGTTCCTTG AGTCCTTTGG A 3052 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 922 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Arg Phe Ser Leu Cys Gly Phe Pro Leu Val Phe Ser Leu Thr Leu 
15 10 15 

Leu Ser Val Phe Asp Thr Ser Leu Ser Ala Thr Thr lie Ser Leu Thr 
20 25 30 

Pro Glu Asp Ser Phe His Gly Asp Ser Gin Asn Ala Glu Arg Ser Tyr 
35 40 45 

Asn Val Gin Ala Gly Asp Val Tyr Ser Leu Thr Gly Asp Val Ser lie 
50 55 60 

Ser Asn Val Asp Asn Ser Ala Leu Asn Lys Ala Cys Phe Asn Val Thr 
65 70 75 80 

Ser Gly Ser Val Thr Phe Ala Gly Asn His His Gly Leu Tyr Phe Asn 
85 90 95 

Asn lie Ser Ser Gly Thr Thr Lys Glu Gly Ala Val Leu Cys Cys Gin 
100 105 110 

Asp Pro Gin Ala Thr Ala Arg Phe Ser Gly Phe Ser Thr Leu Ser Phe 
115 120 125 

lie Gin Ser Pro Gly Asp lie Lys Glu Gin Gly Cys Leu Tyr Ser Lys 
130 135 140 

Asn Ala Leu Met Leu Leu Asn Asn Tyr Val Val Arg Phe Glu Gin Asn 
145 150 155 160 

Gin Ser Lys Thr Lys Gly Gly Ala lie Ser Gly Ala Asn Val Thr lie 
165 170 175 

Val Gly Asn Tyr Asp Ser Val Ser Phe Tyr Gin Asn Ala Ala Thr Phe 
180 185 190 

Gly Gly Ala lie His Ser Ser Gly Pro Leu Gin lie Ala Val Asn Gin 
195 200 205 

Ala Glu lie Arg Phe Ala Gin Asn Thr Ala Lys Asn Gly Ser Gly Gly 
210 215 220 

Ala Leu Tyr Ser Asp Gly Asp lie Asp lie Asp Gin Asn Ala Tyr Val 
225 230 235 240 

Leu Phe Arg Glu Asn Glu Ala Leu Thr Thr Ala lie Gly Lys Gly Gly 
245 250 255 



Ala Val Cys Cys Leu Pro Thr Ser Gly Ser Ser Thr Pro Val Pro lie 
260 265 270 



I 



50 



Val Thr Phe Ser 
275 

lie Met Gly Gly 
290 

Gly Gly Pro Thr 
305 

Asn Leu Gly Gly 



Ser Ala Glu Lys 
340 

Pro Phe Leu Asn 
355 

Leu Gin Ala Arg 
370 

Ser Glu Ala Asp 
385 

Asn Lys Glu Tyr 



Ala Asn Asp Pro 
420 

Leu Ser Ala Gly 
435 

Ser Lys Phe Thr 
450 

Thr Lys Leu lie 
465 

lie Asp lie Asp 



Ala Asn Thr Ala 
500 

lie Ser Pro Thr 
515 

Gin Thr Phe Pro 
530 

Thr Val Thr Ala 
545 

Gin Gly Asn Trp 



Glu Phe Phe Trp 
580 



Asp Asn Lys Gin 
280 

Gly Ala lie Tyr 
295 

Leu Phe lie Asn 
310 

Ala lie Ala lie 
325 

Gly Thr lie Thr 



Gly lie His Leu 
360 

Asn Gly Cys Ser 
375 

Gly Ser Thr Gin 
390 

Thr Gly Thr lie 
405 

Arg Asp Phe Lys 



Tyr Leu Val lie 
440 

Gin Ser Pro Gly 
455 

Ala Ser Lys Glu 
470 

Ser Leu Ser Ser 
485 

Asn Lys Gin lie 



Gly Asn Ala Tyr 
520 

Leu Leu Ser Leu 
535 

Gly Asp Phe Leu 
550 

Lys Leu Ala Trp 
565 

Asp Lys lie Asn 



Leu Val Phe Glu 



Ala Arg Lys Leu 
300 

Asn lie Ser Tyr 
315 

Asp Thr Gly Gly 
330 

Phe Gin Gly Asn 
345 

Leu Gin Asn Ala 



lie Glu Phe Tyr 
380 

Leu Asn lie Asn 
395 

Leu Phe Ser Gly 
410 

Ser Thr lie Pro 
425 

Lys Glu Gly Ala 



Ser His Leu Val 
460 

Asp lie Ala lie 
475 

Ser Ser Thr Ala 
490 

Ser Val Thr Asp 
505 

Glu Asp Leu Arg 



Glu Pro Gly Ala 
540 

Pro Val Ser Pro 
555 

Thr Gly Thr Gly 
570 

Tyr Lys Pro Arg 
585 



Arg Asn His Ser 
285 

Ser lie Ser Ser 

Ala Asn Ser Gin 
32 0 

Glu lie Ser Leu 
335 

Arg Thr Ser Leu 

350 

Lys Phe Leu Lys 
365 

Asp Pro lie Thr 



Gly Asp Pro Lys 
400 

Glu Lys Ser Leu 
415 

Gin Asn Val Asn 
430 

Glu Val Thr Val 
445 

Leu Asp Leu Gly 



Thr Gly Leu Ala 
480 

Ala Val lie Lys 
495 

Ser lie Glu Leu 
510 

Met Arg Asn Ser 
525 

Gly Gly Ser Val 



His Tyr Gly Phe 
560 

Asn Lys Val Gly 
575 

Pro Glu Lys Glu 
590 
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Gly Asn Leu Val Pro Asn lie Leu Trp Gly Asn Ala Val Asn Val Arg 
595 600 605 

Ser Leu Met Gin Val Gin Glu Thr His Ala Ser Ser Leu Gin Thr Asp 
610 615 620 

Arg Gly Leu Trp lie Asp Gly lie Gly Asn Phe Phe His Val Ser Ala 
625 630 635 640 

Ser Glu Asp Asn lie Arg Tyr Arg His Asn Ser Gly Gly Tyr Val Leu 
645 650 655 

Ser Val Asn Asn Glu lie Thr Pro Lys His Tyr Thr Ser Met Ala Phe 
660 665 670 

Ser Gin Leu Phe Ser Arg Asp Lys Asp Tyr Ala Val Ser Asn Asn Glu 
675 680 685 

Tyr Arg Met Tyr Leu Gly Ser Tyr Leu Tyr Gin Tyr Thr Thr Ser Leu 
690 695 700 

Gly Asn lie Phe Arg Tyr Ala Ser Arg Asn Pro Asn Val Asn Val Gly 
705 710 715 720 

lie Leu Ser Arg Arg Phe Leu Gin Asn Pro Leu Met lie Phe His Phe 
725 730 735 

Leu Cys Ala Tyr Gly His Ala Thr Asn Asp Met Lys Thr Asp Tyr Ala 
740 745 750 

Asn Phe Pro Met Val Lys Asn Ser Trp Arg Asn Asn Cys Trp Ala lie 
755 760 % 765 

Glu Cys Gly Gly Ser Met Pro Leu Leu Val Phe Glu Asn Gly Arg Leu 
770 775 780 

Phe Gin Gly Ala lie Pro Phe Met Lys Leu Gin Leu Val Tyr Ala Tyr 
785 790 795 800 

Gin Gly Asp Phe Lys Glu Thr Thr Ala Asp Gly Arg Arg Phe Ser Asn 
805 810 815 

Gly Ser Leu Thr Ser lie Ser Val Pro Leu Gly lie Arg Phe Glu Lys 
820 825 830 

Leu Ala Leu Ser Gin Asp Val Leu Tyr Asp Phe Ser Phe Ser Tyr lie 
835 840 845 

Pro Asp lie Phe Arg Lys Asp Pro Ser Cys Glu Ala Ala Leu Val lie 
850 . 855 860 

Ser Gly Asp Ser Trp Leu Val Pro Ala Ala His Val Ser Arg His Ala 
865 870 875 880 



Phe Val Gly Ser Gly Thr Gly Arg Tyr His Phe Asn Asp Tyr Thr Glu 
885 890 895 
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Leu Leu Cys Arg Gly Ser lie Glu Cys Arg Pro His Ala Arg Asn Tyr 
900 905 910 

Asn lie Asn Cys Gly Ser Lys Phe Arg Phe 
915 920 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

AT GAAGAT T C CACTCCGCTT T T TAT T GAT A TCATTAGTAC CTACGCTTTC TATGTCGAAT 60 

T TAT TAG GAG CTGCTACTAC CGAAGAGCTA TCGGCTAGCA ATAGCTTCGA TGGAACTACA 12 0 

TCAACAACAA GCTTTTCTAG TAAAAC AT C A TCGGCTACAG AT G G C AC C AA TTATGTTTTT 18 0 

AAAGATTCTG TAG T TAT AG A AAATGTACCC AAAACAGGGG AAACTCAGTC TACTAGTTGT 24 0 

TTTAAAAATG ACGCTGCAGC TGGAGATCTA AATTT CTTAG GAGGGGGATT TTCTTTCACA 300 

T T TAG C AAT A TCGATGCAAC CACGGCTTCT GGAGCTGCTA TTGGAAGTGA AGCAGCTAAT 3 60 

AAGACAGTCA CGTTATCAGG ATTTTCGGCA CTTTCTTTTC TTAAATCCCC AGCAAGTACA 42 0 

GTGACTAATG GATTGGGAGC TAT C AAT GT T AAAGGGAATT TAAGCCTATT G GAT AAT GAT 4 80 

AAGGTATTGA T T C AG GAC AA TTTCTCAACA GGAGATGGCG GAG C AAT T AA TTGTGCAGGC 54 0 

TCCTTGAAGA TCGCAAACAA TAAGTCCCTT TCTTTTATTG GAAATAGTTC TTCAACACGT 600 

GGCGGAGCGA T T CAT AC C AA AAACCTCACA CTATCTTCTG GTGGGGAAAC TCTATTTCAG 660 

G G GAAT AC AG CGCCTACGGC TGCTGGTAAA GGAGGTGCTA TCGCGATTGC AGACTCTGGC 72 0 

ACCCTATCCA TTTCTGGAGA CAGTGGCGAC ATTATCTTTG AAGGCAATAC GAT AG GAG C T 780 

AC AG GAAC C G TCTCTCATAG TGCTATTGAT TTAGGAACTA GCGCTAAGAT AACTGCGTTA 84 0 

CGTGCTGCGC AAG GAC AT AC GAT AT AC T T T TAT GAT C C GA TTACTGTAAC AGGATCGACA 90 0 

TCTGTTGCTG ATGCTCTCAA TAT T AAT AG C CCTGATACTG GAGATAACAA AGAGT AT AC G 9 60 

G GAAC CAT AG TCTTTTCTGG AGAGAAG C T C AC G GAG G C AG AAGCTAAAGA TGAGAAGAAC 102 0 

CGCACTTCTA AAT T AC T T C A AAATGTTGCT TTTAAAAATG GGACTGTAGT TTTAAAAGGT 108 0 

GATGTCGTTT TAAGTGCGAA CGGTTTCTCT C AG GAT G C AA ACTCTAAGTT GAT TAT G GAT 114 0 

TTAGGGACGT CGTTGGTTGC AAACACCGAA AGTATCGAGT TAACGAATTT GGAAATTAAT 12 00 

ATAGACTCTC TCAGGAACGG GAAAAAGATA AAACT CAGT G CTGCCACAGC TCAGAAAGAT 12 60 

ATTCGTATAG ATCGTCCTGT TGTACTGGCA AT TAG C GAT G AGAGTTTTTA TCAAAATGGC 132 0 

TTTTTGAATG AG GAC CAT T C CTATGATGGG ATTCTTGAGT TAGATGCTGG GAAAGAC AT C 13 8 0 

GTGATTTCTG CAGATTCTCG CAGTATAAAT GCT GTACAAT CTCCGTATGG CTATCAGGGA 14 4 0 

AAGTGGACAA TCAATTGGTC TACTGATGAT AAGAAAGCTA CGGTTTCTTG GGCAAAGCAA 15 00 

AGTTTTAATC CCACTGCTGA GCAGGAGGCT CCGTTAGTTC CTAATCTTCT TTGGGGTTCT 1560 

T T TAT AG AT G TTCGTCCCTT CCAAAATTTT ATAGAGCTAG GTACTGAAGG TGCTCCTTAC 162 0 

GAAAAGAGAT TTTGGGTTGC AGGCATTTCC AATGTTTTGC ATAGGAGCGG TCGTGAAAAT 168 0 

CAAAG GAAAT TCCGTCATGT GAGT GGAGGT GCTGTAGTAG GT G C TAG C AC GAGGATGCCG 17 4 0 

GGTGGTGATA CCTTGTCTCT GGGTTTTGCT CAGCTCTTTG C G C GT GAC AA AGACTACTTT 18 00 

AT GAAT AC C A ATTTCGCAAA GAC C T AC G C A GGATCTTTAC GTTTGCAGCA CGATGCTTCC 18 60 

CTATACTCTG TGGTGAGTAT CCTTTTAGGA GAG GGAG GAC TCCGCGAGAT CCTGTTGCCT 192 0 

TATGTTTCCA AGACTCTGCC GTGCTCTTTC TATGGGCAGC TTAGCTACGG C CAT AC G GAT 198 0 

CAT C G CAT G A AGACCGAGTC TCTACCCCCC CCCCCCCCGA CGCTCTCGAC G GAT CAT AC T 2 04 0 

TCTTGGGGAG GATATGTCTG GGCTGGAGAG CTGGGAACTC GAGTTGCTGT T GAAAAT AC C 2100 

AGCGGCAGAG GATTTTTCCG AGAGTACACT CCATTTGTAA AAGTCCAAGC TGTTTACTCG 2160 

CGCCAAGATA GCTTTGTTGA ACTAGGAGCT AT CAGT C GT G ATTTTAGTGA TTCGCATCTT 222 0 

TATAACCTTG CGATTCCTCT T G GAAT C AAG TTAGAGAAAC GGTTTGCAGA G C AAT AT TAT 22 8 0 

CATGTTGTAG C GAT GT AT T C TCCAGATGTT TGTCGTAGTA ACCCCAAATG T AC GAC T AC C 2 34 0 

CTACTTTCCA ACCAAGGGAG TTGGAAGACC AAAGGTTCGA ACTTAGCAAG ACAGGCTGGT 24 00 
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ATTGTTCAGG CCTCAGGTTT TCGATCTTTG GGAGCT GCAG CAGAGCTTTT CGGGAACTTT 2 4 60 
GGCTTTGAAT GGCGGGGATC TTCTCGTAGC TATAATGTAG ATGCGGGTAG CAAAATCAAA 252 0 
TTTTAG 2 52 6 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Lys lie Pro Leu Arg Phe Leu Leu lie Ser Leu Val Pro Thr Leu 
15 10 15 

Ser Met Ser Asn Leu Leu Gly Ala Ala Thr Thr Glu Glu Leu Ser Ala 
20 25 30 

Ser Asn Ser Phe Asp Gly Thr Thr Ser Thr Thr Ser Phe Ser Ser Lys 
35 40 45 

Thr Ser Ser Ala Thr Asp Gly Thr Asn Tyr Val Phe Lys Asp Ser Val 
50 55 60 

Val lie Glu Asn Val Pro Lys Thr Gly Glu Thr Gin Ser Thr Ser Cys 
65 70 75 80 

Phe Lys Asn Asp Ala Ala Ala Gly Asp Leu Asn Phe Leu Gly Gly Gly 
85 90 95 

Phe Ser Phe Thr Phe Ser Asn lie Asp Ala Thr Thr Ala Ser Gly Ala 
100 105 110 

Ala lie Gly Ser Glu Ala Ala Asn Lys Thr Val Thr Leu Ser Gly Phe 
115 120 125 

Ser Ala Leu Ser Phe Leu Lys Ser Pro Ala Ser Thr Val Thr Asn Gly 
130 135 140 

Leu Gly Ala lie Asn Val Lys Gly Asn Leu Ser Leu Leu Asp Asn Asp 
145 150 155 160 

Lys Val Leu lie Gin Asp Asn Phe Ser Thr Gly Asp Gly Gly Ala lie 
165 170 175 

Asn Cys Ala Gly Ser Leu Lys lie Ala Asn Asn Lys Ser Leu Ser Phe 
180 185 190 

lie Gly Asn Ser Ser Ser Thr Arg Gly Gly Ala lie His Thr Lys Asn 
195 200 205 



Leu Thr Leu Ser Ser Gly Gly Glu Thr Leu Phe Gin Gly Asn Thr Ala 
210 215 220 



Gly Lys 
230 

Ser Gly 
245 

Thr Gly 

lie Thr 

Pro lie 

As n Ser 
310 

Phe Ser 
325 

Arg Thr 



Val Leu 



Ala Asn 

Thr Glu 
390 

Arg Asn 
405 

lie Arg 
r Gin 
u Leu 



Gly Gly 

Asp Ser 

Thr Val 

Ala Leu 
280 

Thr Val 
295 

Pro Asp 
Gly Glu 
Ser Lys 

Lys Gly 

360 

Ser Lys 
375 

Ser lie 

Gly Lys 

lie Asp 

Asn Gly 
440 

Asp Ala 
455 



Ala lie 

Gly Asp 
250 

Ser His 
265 

Arg Ala 

Thr Gly 

Thr Gly 

Lys Leu 
330 

Leu Leu 
345 

Asp Val 

Leu lie 

Glu Leu 

Lys lie 
410 

Arg Pro 
425 

Phe Leu 



Gly Lys 



Ala lie 

235 

lie lie 

Ser Ala 

Ala Gin 

Ser Thr 
300 

Asp Asn 
315 

Thr Glu 

Gin Asn 

Val Leu 

Met Asp 
380 

Thr Asn 
395 

Lys Leu 

Val Val 

Asn Glu 

Asp lie 
460 



Ala Asp 

Phe Glu 

lie Asp 
270 

Gly His 
285 

Ser Val 

Lys Glu 

Ala Glu 

Val Ala 
350 

Ser Ala 
365 

Leu Gly 
Leu Glu 



Ser Ala 



Leu Ala 
430 

Asp His 
445 

Val lie 



Ser Gly 
240 

Gly Asn 
255 

Leu Gly 
Thr lie 
Ala Asp 

Tyr Thr 

320 

Ala Lys 
335 

Phe Lys 

Asn Gly 

Thr Ser 

lie Asn 
400 

Ala Thr 
415 

lie Ser 
Ser Tyr 
Ser Ala 



Pro Thr Ala Ala 
225 

Thr Leu Ser lie 



Thr lie Gly Ala 
260 

Thr Ser Ala Lys 
275 

Tyr Phe Tyr Asp 
290 

Ala Leu Asn lie 
305 

Gly Thr lie Val 



Asp Glu Lys Asn 
340 

Asn Gly Thr Val 
355 

Phe Ser Gin Asp 
370 

Leu Val Ala Asn 
385 

lie Asp Ser Leu 



Ala Gin Lys Asp 
420 

Asp Glu Ser Phe 
435 

Asp Gly lie Leu 
450 

Asp Ser Arg Ser 
465 

Lys Trp Thr lie 



lie Asn Ala Val 
470 

Asn Trp Ser Thr 
485 



Gin Ser Pro Tyr 
475 

Asp Asp Lys Lys 
490 



Gly Tyr Gin Gly 
480 

Ala Thr Val Ser 
495 



Trp Ala Lys Gin Ser Phe Asn Pro Thr Ala Glu Gin Glu Ala Pro Leu 

500 505 510 

Val Pro Asn Leu Leu Trp Gly Ser Phe lie Asp Val Arg Pro Phe Gin 

515 520 525 

Asn Phe lie Glu Leu Gly Thr Glu Gly Ala Pro Tyr Glu Lys Arg Phe 
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530 535 540 

Trp Val Ala Gly lie Ser Asn Val Leu His Arg Ser Gly Arg Glu Asn 
545 550 ' 555 560 

Gin Arg Lys Phe Arg His Val Ser Gly Gly Ala Val Val Gly Ala Ser 
565 570 575 

Thr Arg Met Pro Gly Gly Asp Thr Leu Ser Leu Gly Phe Ala Gin Leu 

580 585 590 

Phe Ala Arg Asp Lys Asp Tyr Phe Met Asn Thr Asn Phe Ala Lys Thr 
595 600 605 

Tyr Ala Gly Ser Leu Arg Leu Gin His Asp Ala Ser Leu Tyr Ser Val 
610 615 620 

Val Ser lie Leu Leu Gly Glu Gly Gly Leu Arg Glu lie Leu Leu Pro 
625 630 635 640 

Tyr Val Ser Lys Thr Leu Pro Cys Ser Phe Tyr Gly Gin Leu Ser Tyr 
645 650 655 

Gly His Thr Asp His Arg Met Lys Thr Glu Ser Leu Pro Pro Pro Pro 
660 665 670 

Pro Thr Leu Ser Thr Asp His Thr Ser Trp Gly Gly Tyr Val Trp Ala 
675 680 685 

Gly Glu Leu Gly Thr Arg Val Ala Val Glu Asn Thr Ser Gly Arg Gly 
690 695 700 

Phe Phe Arg Glu Tyr Thr Pro Phe Val Lys Val Gin Ala Val Tyr Ser 
705 710 715 720 

Arg Gin Asp Ser Phe Val Glu Leu Gly Ala lie Ser Arg Asp Phe Ser 
725 730 735 

Asp Ser His Leu Tyr Asn Leu Ala lie Pro Leu Gly lie Lys Leu Glu 
740 745 750 

Lys Arg Phe Ala Glu Gin Tyr Tyr His Val Val Ala Met Tyr Ser Pro 
755 760 765 

Asp Val Cys Arg Ser Asn Pro Lys Cys Thr Thr Thr Leu Leu Ser Asn 
770 775 780 

Gin Gly Ser Trp Lys Thr Lys Gly Ser Asn Leu Ala Arg Gin Ala Gly 
785 790 795 800 

lie Val Gin Ala Ser Gly Phe Arg Ser Leu Gly Ala Ala Ala Glu Leu 
805 810 815 

Phe Gly Asn Phe Gly Phe Glu Trp Arg Gly Ser Ser Arg Ser Tyr Asn 
820 825 830 



Val Asp Ala Gly Ser Lys lie Lys Phe 
835 840 
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(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATGAAGTCTT CTTTCCCCAA GTTTGTATTT TCTACATTTG CTATTTTCCC TTTGTCTATG 60 

ATTGCTACCG AGACAGTTTT GGATTCAAGT GCGAGTTTCG ATGGGAATAA AAAT GGTAAT 12 0 

TTTTCAGTTC GTGAGAGTCA GGAAGATGCT GGAACTACCT AC C TAT T T AA GGGAAATGTC 180 

ACTCTAGAAA ATATTCCTGG AACAGGCACA GCAATCACAA AAAGCTGTTT TAACAACACT 24 0 

AAGGGCGATT TGACTTTCAC AGGTAACGGG AACTCTCTAT TGTTCCAAAC GGTGGATGCA 3 00 

GGGACT GTAG CAGGGGCTGC TGTTAACAGC AGCGTGGTAG AT AAAT CTAC C AC GT T TATA 3 60 

GGGTTTTCTT CGCTATCTTT TATTGCGTCT CCTGGAAGTT CGATAACTAC CGGCAAAGGA 42 0 

GCCGTTAGCT GCTCTACGGG TAG C T T GAAG TTTGACAAAA ATGTCAGTTT GCTCTTCAGC 4 80 

AAAAACTTTT C AAC G G AT AA TGGCGGTGCT AT C AC C G C AA AAACTCTTTC ATTAACAGGG 54 0 

ACTACAAT GT CAGCTCTGTT TTCTGAAAAT ACCTCCTCAA AGAAAGGCGG AG C CAT T C AG 6 00 

ACTTCCGATG CCCTTACCAT TACTGGAAAC CAAGGGGAAG TCTCTTTTTC TGACAATACT 6 60 

TCTTCGGATT CTGGAGCTGC AATTTTTACA GAAGCCTCGG TGACTATTTC TAATAATGCT 72 0 

AAAGTTTCCT TTATTGACAA TAAGGT CACA GGAGCGAGCT CCTCAACAAC GGGGGATATG 7 80 

TCAGGAGGTG CTATCTGTGC TTATAAAACT AGTACAGATA CTAAGGTCAC CCTCACTGGA 84 0 

AATCAGATGT TACTCTTCAG CAACAATACA TCGACAACAG CGGGAGGAGC TATCTATGTG 9 00 

AAAAAGCTCG AACTGGCTTC CGGAGGACTT ACCCTATTCA GTAGAAATAG T GT C AAT G GA 9.60 

GGTACAGCTC CTAAAGGTGG AGCCATAGCT AT C GAAG AT A GTGGGGAATT GAGTTTATCC 102 0 

GCCGATAGTG GTGACATTGT CTTTTTAGGG AATACAGTCA CTTCTACTAC TCCTGGGACG 10 80 

AATAGAAGTA GTATCGACTT AG GAAC GAG T GCAAAGATGA CAGCTTTGCG TTCTGCTGCT 114 0 

GGTAGAGCCA TCTACTTCTA TGATCCCATA ACTACAGGAT CTTCCACAAC AG T T AC AG AT 12 00 

GTCTTAAAAG TTAATGAGAC TCCGGCAGAT TCTGCACTAC AAT AT AC AG G GAAC AT CAT C 12 60 

T T C AC AG GAG AAAAGTTATC AGAGACAGAG GCCGCAGATT CTAAAAATCT T AC T T C GAAG 132 0 

CTACTACAGC CTGTAACTCT TTCAGGAGGT ACTCTATCTT TAAAACATGG AGTGACTCTG 13 8 0 

CAGACTCAGG CAT T C AC T C A ACAGGCAGAT TCTCGTCTCG AAATGGACGT AGGAACTACT 14 4 0 

CTAGAACCTG C T GAT AC TAG CACCATAAAC AATTTGGTCA T T AAC AT C AG TTCTATAGAC 15 00 

GGT GCAAAGA AG G C AAAAAT AGAAAC CAAA GCTACGTCAA AAAATCTGAC TTTATCTGGA 15 60 

AC CATC ACT T TATTGGACCC GACGGGCACG TTTTATGAAA AT CAT AGT T T AAGAAATCCT 162 0 

CAGTCCTACG ACATCTTAGA GCTCAAAGCT TCTGGAACTG TAACAAGCAC CGCAGTGACT 168 0 

CCAGATCCTA TAATGGGTGA GAAAT T C CAT TACGGCTATC AGGGAACTTG GGGCCCAATT 17 4 0 

GTTTGGGGGA CAGGGGCTTC T AC G AC T G C A ACCTTCAACT GGACTAAAAC TGGCTATATT 18 00 

CCTAATCCCG AGCGTATCGG CTCTTTAGTC C C T AAT AG C T TAT G G AAT G C ATTTATAGAT 18 60 

ATTAGCTCTC TCCATTATCT TATGGAGACT GCAAACGAAG GGTTGCAGGG AGACCGTGCT 192 0 

TTTTGGTGTG CTGGATTATC TAACTTCTTC CATAAGGATA GTACAAAAAC ACGACGCGGG 19 8 0 

TTTCGCCATT TGAGTGGCGG T TAT GT CAT A G GAG GAAAC C TACATACTTG TTCAGATAAG 2 04 0 

ATTCTTAGTG CTGCATTTTG TCAGCTCTTT G GAAGAGAT A GAGACTACTT TGTAGCTAAG 2100 

AATCAAGGTA CAGTCTACGG AGGAACTCTC TAT T AC C AG C ACAAC GAAAC CTATATCTCT 2160 

CTTCCTTGCA AACTACGGCC TTGTTCGTTG TCTTATGTTC C T AC AGAGAT TCCTGTTCTC 222 0 

TTTTCAGGAA ACCTTAGCTA CACCCATACG GAT AAC GAT C T GAAAAC CAA GTATACAACA 22 8 0 

TATCCTACTG TTAAAGGAAG CTGGGGGAAT GAT AGT T T C G CTTTAGAATT CGGTGGAAGA 234 0 

GCTCCGATTT GCTTAGATGA AAGTGCTCTA T T T GAG C AGT ACATGCCCTT CAT GAAAT T G 24 00 

CAGTTTGTCT AT G C AC AT C A GGAAGGTTTT AAAGAAC AG G GAACAGAAGC TCGTGAATTT 24 60 

GGAAGTAGCC GTCTTGTGAA TCTTGCCTTA CCTATCGGGA TCCGATTTGA TAAGGAATCA 2 52 0 

GACTGCCAAG AT G C AAC GT A CAATCTAACT CTT GGT TATA CTGTGGATCT TGTTCGTAGT 258 0 

AACCCCGACT GTACGACAAC ACTGCGAATT AGCGGTGATT CTTGGAAAAC CTTCGGTACG 2 64 0 

AATTTGGCAA GACAAGCTTT AGTCCTTCGT GCAGGGAACC ATTTTTGCTT T AAC T C AAAT 27 00 
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TTT GAAGCCT TTAGCCAATT TTCTTTTGAA TTGCGTGGGT CATCTCGCAA TTACAATGTA 27 60 
GACTTAGGAG CAAAATACCA ATTCTAA 27 87 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Lys Ser Ser Phe Pro Lys Phe Val Phe Ser Thr Phe Ala lie Phe 
15 10 15 

Pro Leu Ser Met lie Ala Thr Glu Thr Val Leu Asp Ser Ser Ala Ser 
20 25 30 

Phe Asp Gly Asn Lys Asn Gly Asn Phe Ser Val Arg Glu Ser Gin Glu 
35 40 45 

Asp Ala Gly Thr Thr Tyr Leu Phe Lys Gly Asn Val Thr Leu Glu Asn 
50 55 60 

lie Pro Gly Thr Gly Thr Ala lie Thr Lys Ser Cys Phe Asn Asn Thr 
65 70 75 80 

Lys Gly Asp Leu Thr Phe Thr Gly Asn Gly Asn Ser Leu Leu Phe Gin 
85 90 95 

Thr Val Asp Ala Gly Thr Val Ala Gly Ala Ala Val Asn Ser Ser Val 
100 105 110 

Val Asp Lys Ser Thr Thr Phe lie Gly Phe Ser Ser Leu Ser Phe lie 
115 120 125 

Ala Ser Pro Gly Ser Ser lie Thr Thr Gly Lys Gly Ala Val Ser Cys 
130 135 140 

Ser Thr Gly Ser Leu Lys Phe Asp Lys Asn Val Ser Leu Leu Phe Ser 
145 150 155 160 

Lys Asn Phe Ser Thr Asp Asn Gly Gly Ala lie Thr Ala Lys Thr Leu 
165 170 175 

Ser Leu Thr Gly Thr Thr Met Ser Ala Leu Phe Ser Glu Asn Thr Ser 
180 185 190 

Ser Lys Lys Gly Gly Ala lie Gin Thr Ser Asp Ala Leu Thr lie Thr 
195 200 205 

Gly Asn Gin Gly Glu Val Ser Phe Ser Asp Asn Thr Ser Ser Asp Ser 
210 215 220 



Gly Ala Ala lie Phe Thr Glu Ala Ser Val Thr lie Ser Asn Asn Ala 
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225 

Lys Val 

Thr Gly 

Asp Thr 

Asn Thr 
290 

Leu Ala 
305 

Gly Thr 

Leu Ser 

Val Thr 

Thr Ser 
370 

Tyr Phe 

385 

Val Leu 

Gly Asn 

As p Ser 

Gly Gly 
450 

Phe Thr 
465 

Leu Glu 

Ser Ser 

Ser Lys 

Gly Thr 
530 



Ser Phe 

Asp Met 
260 

Lys Val 
275 

Ser Thr 

Ser Gly 

Ala Pro 

Leu Ser 
340 

Ser Thr 
355 

Ala Lys 

Tyr Asp 

Lys Val 

lie lie 
420 

Lys Asn 
435 

Thr Leu 

Gin Gin 

Pro Ala 

lie Asp 
500 

Asn Leu 
515 

Phe Tyr 



230 

lie Asp 
245 

Ser Gly 

Thr Leu 

Thr Ala 

Gly Leu 
310 

Lys Gly 
325 

Ala Asp 
Thr Pro 



Met Thr 

Pro lie 

390 

Asn Glu 
405 

Phe Thr 

Leu Thr 

Ser Leu 

Ala Asp 
470 

Asp Thr 
485 

Gly Ala 
Thr Leu 
Glu Asn 



Asn Lys 

Gly Ala 

Thr Gly 
280 

Gly Gly 
295 

Thr Leu 

Gly Ala 

Ser Gly 

Gly Thr 
360 

Ala Leu 
375 

Thr Thr 

Thr Pro 

Gly Glu 

Ser Lys 
440 

Lys His 
455 

Ser Arg 

Ser Thr 

Lys Lys 

Ser Gly 
520 

His Ser 
535 



Val Thr 
250 

lie Cys 
'265 

Asn Gin 

Ala lie 

Phe Ser 

lie Ala 

330 

Asp lie 
345 

Asn Arg 

Arg Ser 

Gly Ser 

Ala Asp 
410 

Lys Leu 
425 

Leu Leu 

Gly Val 

Leu Glu 

lie Asn 
490 

Ala Lys 
505 

Thr lie 
Leu Arg 



235 

Gly Ala 

Ala Tyr 

Met Leu 

Tyr Val 
300 

Arg Asn 
315 

lie Glu 

Val Phe 

Ser Ser 

Ala Ala 
380 

Ser Thr 
395 

Ser Ala 
Ser Glu 



Gin Pro 

Thr Leu 
460 

Met Asp 
475 

Asn Leu 

lie Glu 

Thr Leu 

Asn Pro 
540 



240 

Ser Ser Ser Thr 
255 

Lys Thr Ser Thr 
270 

Leu Phe Ser Asn 
285 

Lys Lys Leu Glu 



Ser Val Asn Gly 
320 

Asp Ser Gly Glu 
335 

Leu Gly Asn Thr 
350 

lie Asp Leu Gly 
365 

Gly Arg Ala lie 



Thr Val Thr Asp 
400 

Leu Gin Tyr Thr 
415 

Thr Glu Ala Ala 
430 

Val Thr Leu Ser 
445 

Gin Thr Gin Ala 



Val Gly Thr Thr 
480 

Val lie Asn lie 
495 

Thr Lys Ala Thr 
510 

Leu Asp Pro Thr 
525 

Gin Ser Tyr Asp 
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lie Leu Glu Leu Lys Ala Ser Gly Thr Val Thr Ser Thr Ala Val Thr 
545 550 555 560 

Pro Asp Pro lie Met Gly Glu Lys Phe His Tyr Gly Tyr Gin Gly Thr 
565 570 575 

Trp Gly Pro He Val Trp Gly Thr Gly Ala Ser Thr Thr Ala Thr Phe 
580 585 590 

Asn Trp Thr Lys Thr Gly Tyr He Pro Asn Pro Glu Arg He Gly Ser 
595 600 605 

Leu Val Pro Asn Ser Leu Trp Asn Ala Phe He Asp He Ser Ser Leu 
610 615 620 

His Tyr Leu Met Glu Thr Ala Asn Glu Gly Leu Gin Gly Asp Arg Ala 
625 630 635 640 

Phe Trp Cys Ala Gly Leu Ser Asn Phe Phe His Lys Asp Ser Thr Lys 
645 650 655 

Thr Arg Arg Gly Phe Arg His Leu Ser Gly Gly Tyr Val He Gly Gly 
660 665 670 

Asn Leu His Thr Cys Ser Asp Lys He Leu Ser Ala Ala Phe Cys Gin 
675 680 685 

Leu Phe Gly Arg Asp Arg Asp Tyr Phe Val Ala Lys Asn Gin Gly Thr 
690 695 700 

Val Tyr Gly Gly Thr Leu Tyr Tyr Gin His Asn Glu Thr Tyr lie Ser 
705 710 715 720 

Leu Pro Cys Lys Leu Arg Pro Cys Ser Leu Ser Tyr Val Pro Thr Glu 
725 730 735 

He Pro Val Leu Phe Ser Gly Asn Leu Ser Tyr Thr His Thr Asp Asn 
740 745 750 

Asp Leu Lys Thr Lys Tyr Thr Thr Tyr Pro Thr Val Lys Gly Ser Trp 
755 760 765 

Gly Asn Asp Ser Phe Ala Leu Glu Phe Gly Gly Arg Ala Pro He Cys 
770 775 780 

Leu Asp Glu Ser Ala Leu Phe Glu Gin Tyr Met Pro Phe Met Lys Leu 
785 790 795 800 

Gin Phe Val Tyr Ala His Gin Glu Gly Phe Lys Glu Gin Gly Thr Glu 
805 810 815 

Ala Arg Glu Phe Gly Ser Ser Arg Leu Val Asn Leu Ala Leu Pro lie 
820 825 830 

Gly He Arg Phe Asp Lys Glu Ser Asp Cys Gin Asp Ala Thr Tyr Asn 
835 840 845 



60 



Leu Thr Leu Gly Tyr Thr Val Asp Leu Val Arg Ser Asn Pro Asp Cys 

850 855 860 

Thr Thr Thr Leu Arg lie Ser Gly Asp Ser Trp Lys Thr Phe Gly Thr 

865 870 875 880 

Asn Leu Ala Arg Gin Ala Leu Val Leu Arg Ala Gly Asn His Phe Cys 

885 890 895 

Phe Asn Ser Asn Phe Glu Ala Phe Ser Gin Phe Ser Phe Glu Leu Arg 



Gly Ser Ser Arg Asn Tyr Asn Val Asp Leu Gly Ala Lys Tyr Gin Phe 
915 920 925 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2757 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATGAGATCGT CTTTTTCCTT GTTATTAATA TCTTCATCTC TAGCCTTTCC TCTCTTAATG 60 

AGTGTTTCTG CAGATGCTGC C GAT C T C AC A TTAGGGAGTC GTGACAGTTA TAATGGTGAT 12 0 

ACAAGCACCA CAGAATTTAC TCCTAAAGCG GCAACTTCTG ATGCTAGTGG CACGACCTAT 180 

ATTCTCGATG GGGATGTCTC GATAAGCCAA GCAGGGAAAC AAACGAGCTT AACCACAAGT 24 0 

TGTTTTTCTA AC AC T G C AG G AAATCTTACC TTCTTAGGGA ACGGATTTTC TCTTCATTTT 300 

GACAAT AT T A TTTCGTCTAC TGTTGCAGGT GTTGTTGTTA GCAATACAGC AGCTTCTGGG 360 

AT T AC G AAAT TCTCAGGATT TTCAACTCTT CGGATGCTTG CAGCTCCTAG GACCACAGGT 42 0 

AAAGGAGC CA TTAAAATTAC CGATGGTCTG GTGTTTGAGA GTATAGGGAA TCTTGACCAA 4 80 

AATGAAAATG CCTCTAGTGA AAAT GGGGGA G C CAT C AAT A CGAAGACTTT GTCTTTGACT 54 0 

GGGAGTACGC GGTTTGTAGC GTTCCTTGGC AATAGCTCGT CGCAACAAGG GGGAGCGATC 600 

TATGCTTCTG GTGACTCTGT GATTTCTGAG AATGCAGGAA TCTTGAGCTT C G GAAAC AAC 660 

AGTGCGACAA CATCAGGAGG CGCGATCTCT GCTGAAGGGA AC CT T GT GAT C T C C AAT AAC 72 0 

CAAAATATCT TTTTCGATGG CT GCAAAGCA AC T AC AAAT G GCGGAGCTAT T GAT T GT AAC 7 80 

AAAGCAGGGG CGAACCCAGA CCCTATCTTG ACTCTTTCAG GAAAT GAGAG CCTGCATTTT 84 0 

CT GAATAACA CAGCAGGAAA TAGTGGAGGT G C GAT T TATA CCAAAAAATT GGTGTTATCC 900 

TCAGGACGAG GAGGAGTGTT ATTTTCTAAC AACAAAGCTG CGAATGCTAC TCCTAAAGGA 960 

GGGGCAATTG C GAT T C TAG A TTCTG GAGAG AT TAG CAT T T CT GCAGAT CT C G G C AAT AT C 1020 

ATTTTCGAGG GCAATACTAC GAGCACTACA GGAAGTCCTG CGAGTGTGAC C AGAAAT GC T 108 0 

ATAGATCTTG CAT C GAAT GC AAAATTTTTA AAT CT C C GAG CGACTCGGGG AAAT AAAGT T 114 0 

ATTTTCTATG ATCCTATCAC GAGCT CAGGA GCTACTGATA AGCTCTCTTT GAATAAAGCT 12 00 

GACGCAGGAT C T G GAAAT AC CTATGAAGGC TACATCGTTT TCTCTGGAGA GAAACTCTCA 12 60 

GAAGAG GAAC TTAAGAAACC T GACAAT CT G AAGTCTACAT T T AC AC AG G C TGTAGAGCTT 132 0 

GCTGCAGGTG CCTTAGTATT GAAAGAT G G A GTGACTGTAG T T G C AAAT AC TAT AAC G C AG 138 0 

GTCGAGGGAT CGAAAGTCGT TAT G GAT G G A GGGACTACTT TTGAGGCAAG CGCTGAGGGG 14 4 0 

GTCACTCTCA ATGGCCTAGC CATTAATATA GATTCCTTAG AT G G GAC AAA TAAAGCTATC 1500 

ATTAAGGCGA CGGCAGCAAG TAAGGATGTT GCCTTATCAG GGCCTAT CAT GCTTGTAGAT 1560 

GCTCAGGGGA ACT AT TAT GA G CAT CAT AAT CTCAGTCAAC AGCAGGTCTT TCCTTTAATA 162 0 

GAGCTTTCTG CACAAGGAAC GATGACTACT ACAGAT AT C C CCGATACCCC AAT TCT AAAT 168 0 

AC T AC GAAT C ACTATGGGTA TCAAGGAACT GGAATAATTG TTTGGGTCGA CGATGCAACT 17 4 0 

GCAAAAACAA AAAATGCTAC CTTAACTTGG ACTAAAACAG GATACAAGCC GAATCCAGAA 18 00 

CGTCAGGGAC CTTTGGTTCC T AAT AG C C T G TGGGGTTCTT TTGTCGATGT CCGCTCCATT 18 60 
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CAGAGCCTCA TGGACCGGAG CACAAGTTCG TTATCTTCGT CAACAAATTT GTGGGTATCA 192 0 

GGAATCGCGG ACTTTTTGCA T GAAGAT C AG AAAGGAAACC AACGTAGTTA TCGTCATTCT 198 0 

AGCGCGGGTT AT GCAT TAG G AG GAG GAT T C TTCACGGCTT CTGAAAATTT CTTTAATTTT 2 04 0 

GCTTTTTGTC AGCTTTTTGG CTACGACAAG GACCATCTTG TGGCTAAGAA CCATACCCAT 210 0 

GTATAT GCAG GGGCAATGAG TTACCGACAC CTCGGAGAGT CTAAGACCCT CGCTAAGATT 2160 

TTGTCAGGAA ATTCTGACTC CCTACCTTTT GTCTTCAATG CTCGGTTTGC TTATGGCCAT 222 0 

ACCGACAATA ACATGACCAC AAAGTACACT GGCTATTCTC CTGTTAAGGG AAGCTGGGGA 22 8 0 

AATGATGCCT TCGGTATAGA AT GT G GAG GA GCTATCCCGG TAGTTGCTTC AGGACGTCGG 234 0 

TCTTGGGTGG ATACCCACAC GCCATTTCTA AACCTAGAGA TGATCTATGC ACAT CAGAAT 24 0 0 

GACTTTAAGG AAAACGGCAC AGAAGGCCGT TCTTTCCAAA GTGAAGACCT CTTCAATCTA 2 4 60 

GCGGTTCCTG TAG G GAT AAA AT T T GAG AAA TTCTCCGATA AGT CTACGTA TGATCTCTCC 2 52 0 

AT AG C T T AC G TTCCCGATGT GATTCGTAAT GATCCAGGCT GCAC GACAAC TCTTATGGTT 258 0 

TCTGGGGATT CTTGGTCGAC ATGTGGTACA AGCTTGTCTA GACAAGCTCT T CT T GT AC GT 2 64 0 

GCTGGAAATC AT CATGCCTT TGCTTCAAAC TTTGAAGTTT TCAGTCAGTT T GAAGT C GAG 270 0 

TTGCGAGGTT CTTCTCGTAG CTATGCTATC GAT CT T G GAG GAAGAT T C G G ATTTTAA 2 7 57 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Met Arg Ser Ser Phe Ser Leu Leu Leu lie Ser Ser Ser Leu Ala Phe 
15 10 15 

Pro Leu Leu Met Ser Val Ser Ala Asp Ala Ala Asp Leu Thr Leu Gly 
20 25 30 

Ser Arg Asp Ser Tyr Asn Gly Asp Thr Ser Thr Thr Glu Phe Thr Pro 
35 40 45 

Lys Ala Ala Thr Ser Asp Ala Ser Gly Thr Thr Tyr lie Leu Asp Gly 
50 55 60 

Asp Val Ser lie Ser Gin Ala Gly Lys Gin Thr Ser Leu Thr Thr Ser 
65 70 75 80 

Cys Phe Ser Asn Thr Ala Gly Asn Leu Thr Phe Leu Gly Asn Gly Phe 
85 90 95 

Ser Leu His Phe Asp Asn lie lie Ser Ser Thr Val Ala Gly Val Val 
100 105 110 

Val Ser Asn Thr Ala Ala Ser Gly lie Thr Lys Phe Ser Gly Phe Ser 
115 120 125 

Thr Leu Arg Met Leu Ala Ala Pro Arg Thr Thr Gly Lys Gly Ala lie 
130 135 140 

Lys lie Thr Asp Gly Leu Val Phe Glu Ser lie Gly Asn Leu Asp Gin 
145 150 155 160 
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Asn Glu Asn Ala Ser Ser Glu Asn Gly Gly Ala lie Asn Thr Lys Thr 
165 170 175 

Leu Ser Leu Thr Gly Ser Thr Arg Phe Val Ala Phe Leu Gly Asn Ser 
180 185 190 

Ser Ser Gin Gin Gly Gly Ala lie Tyr Ala Ser Gly Asp Ser Val lie 
195 200 205 

Ser Glu Asn Ala Gly lie Leu Ser Phe Gly Asn Asn Ser Ala Thr Thr 
210 215 220 

Ser Gly Gly Ala lie Ser Ala Glu Gly Asn Leu Val He Ser Asn Asn 
225 230 235 240 

Gin Asn He Phe Phe Asp Gly Cys Lys Ala Thr Thr Asn Gly Gly Ala 
245 250 255 

He Asp Cys Asn Lys Ala Gly Ala Asn Pro Asp Pro He Leu Thr Leu 
260 265 270 

Ser Gly Asn Glu Ser Leu His Phe Leu Asn Asn Thr Ala Gly Asn Ser 
275 280 285 

Gly Gly Ala He Tyr Thr Lys Lys Leu Val Leu Ser Ser Gly Arg Gly 
290 295 300 

Gly Val Leu Phe Ser Asn Asn Lys Ala Ala Asn Ala Thr Pro Lys Gly 
305 310 315 320 

Gly Ala He Ala He Leu Asp Ser Gly Glu He Ser He Ser Ala Asp 
325 330 335 

Leu Gly Asn He He Phe Glu Gly Asn Thr Thr Ser Thr Thr Gly Ser 
340 345 350 

Pro Ala Ser Val Thr Arg Asn Ala He Asp Leu Ala Ser Asn Ala Lys 
355 360 365 

Phe Leu Asn Leu Arg Ala Thr Arg Gly Asn Lys Val He Phe Tyr Asp 
370 375 380 

Pro He Thr Ser Ser Gly Ala Thr Asp Lys Leu Ser Leu Asn Lys Ala 
385 390 395 400 

Asp Ala Gly Ser Gly Asn Thr Tyr Glu Gly Tyr He Val Phe Ser Gly 
405 410 415 

Glu Lys Leu Ser Glu Glu Glu Leu Lys Lys Pro Asp Asn Leu Lys Ser 
420 425 430 

Thr Phe Thr Gin Ala Val Glu Leu Ala Ala Gly Ala Leu Val Leu Lys 
435 440 445 

Asp Gly Val Thr Val Val Ala Asn Thr He Thr Gin Val Glu Gly Ser 
450 455 460 



Lys Val Val Met Asp Gly Gly Thr Thr Phe Glu Ala Ser Ala Glu Gly 
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465 470 475 480 

Val Thr Leu Asn Gly Leu Ala lie Asn lie Asp Ser Leu Asp Gly Thr 
485 490 495 

Asn Lys Ala lie lie Lys Ala Thr Ala Ala Ser Lys Asp Val Ala Leu 
500 505 510 

Ser Gly Pro lie Met Leu Val Asp Ala Gin Gly Asn Tyr Tyr Glu His 
515 520 525 

His Asn Leu Ser Gin Gin Gin Val Phe Pro Leu lie Glu Leu Ser Ala 
530 535 540 

Gin Gly Thr Met Thr Thr Thr Asp lie Pro Asp Thr Pro lie Leu Asn 
545 550 555 560 

Thr Thr Asn His Tyr Gly Tyr Gin Gly Thr Gly lie lie Val Trp Val 
565 570 575 

Asp Asp Ala Thr Ala Lys Thr Lys Asn Ala Thr Leu Thr Trp Thr Lys 
580 585 590 

Thr Gly Tyr Lys Pro Asn Pro Glu Arg Gin Gly Pro Leu Val Pro Asn 
595 600 605 

Ser Leu Trp Gly Ser Phe Val Asp Val Arg Ser lie Gin Ser Leu Met 
610 615 620 

Asp Arg Ser Thr Ser Ser Leu Ser Ser Ser Thr Asn Leu Trp Val Ser 
625 630 635 640 

Gly lie Ala Asp Phe Leu His Glu Asp Gin Lys Gly Asn Gin Arg Ser 
645 650 655 

Tyr Arg His Ser Ser Ala Gly Tyr Ala Leu Gly Gly Gly Phe Phe Thr 
660 665 670 

Ala Ser Glu Asn Phe Phe Asn Phe Ala Phe Cys Gin Leu Phe Gly Tyr 
675 680 685 

Asp Lys Asp His Leu Val Ala Lys Asn His Thr His Val Tyr Ala Gly 
690 695 700 

Ala Met Ser Tyr Arg His Leu Gly Glu Ser Lys Thr Leu Ala Lys lie 
705 710 715 720 

Leu Ser Gly Asn Ser Asp Ser Leu Pro Phe Val Phe Asn Ala Arg Phe 
725 730 735 

Ala Tyr Gly His Thr Asp Asn Asn Met Thr Thr Lys Tyr Thr Gly Tyr 
740 745 750 

Ser Pro Val Lys Gly Ser Trp Gly Asn Asp Ala Phe Gly lie Glu Cys 
755 760 765 

Gly Gly Ala lie Pro Val Val Ala Ser Gly Arg Arg Ser Trp Val Asp 
770 775 780 
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Thr His Thr Pro Phe Leu Asn Leu Glu Met lie Tyr Ala His Gin Asn 
785 790 795 800 

Asp Phe Lys Glu Asn Gly Thr Glu Gly Arg Ser Phe Gin Ser Glu Asp 
805 810 815 

Leu Phe Asn Leu Ala Val Pro Val Gly lie Lys Phe Glu Lys Phe Ser 
820 825 830 

Asp Lys Ser Thr Tyr Asp Leu Ser lie Ala Tyr Val Pro Asp Val lie 
835 840 845 

Arg Asn Asp Pro Gly Cys Thr Thr Thr Leu Met Val Ser Gly Asp Ser 
850 855 860 

Trp Ser Thr Cys Gly Thr Ser Leu Ser Arg Gin Ala Leu Leu Val Arg 
865 870 875 880 

Ala Gly Asn His His Ala Phe Ala Ser Asn Phe Glu Val Phe Ser Gin 
885 890 895 

Phe Glu Val Glu Leu Arg Gly Ser Ser Arg Ser Tyr Ala lie Asp Leu 
900 905 910 

Gly Gly Arg Phe Gly Phe 
915 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATGAAATCCT CTCTTCATTG GTTTGTAATC TCGTCATCTT TAGCACTTCC CTTGTCACTA 60 

AATTTCTCTG CGTTTGCTGC TGTTGTTGAA AT C AAT C TAG GACCTACCAA TAGCTTCTCT 12 0 

GGACCAGGAA CCTACACTCC TCCAGCCCAA ACAACAAATG CAGATGGAAC TAT CTATAAT 18 0 

CTAACAGGGG ATGTCTCAAT C AC C AAT G C A GGATCTCCGA CAGCTCTAAC CGCTTCCTGC 24 0 

TTTAAAGAAA CTACTGGGAA TCTTTCTTTC CAAGGCCACG G C T AC C AAT T TCTCCTACAA 30 0 

AAT AT C GAT G CGGGAGCGAA CTGTACCTTT AC CAATACAG CTGCAAATAA GCTTCTCTCC 3 60 

TTTT CAGGAT TCTCCTATTT GTCACTAATA CAAACCACGA AT G C T AC C AC AG GAAC AGGA 42 0 

G C CAT CAAGT CCACAGGAGC TTGTTCTATT CAGTCGAACT ATAGTTGCTA CTTTGGCCAA 4 80 

AACTTTTCTA ATGACAATGG AGGCGCCCTC CAAGGCAGCT CTATCAGTCT ATCGCTAAAC 54 0 

CCCAACCTAA CGTTTGCCAA AAACAAAGCA AC G C AAAAAG GGGGTGCCCT CTATTCCACG 600 

G GAG G GAT T A CAATTAACAA TACGTTAAAC T CAGCAT CAT TTTCTGAAAA TACCGCGGCG 66 0 

AACAATGGCG GAG C CAT T T A CACGGAAGCT AGCAGTTTTA T TAG C AG C AA CAAAGCAATT 72 0 

AGCTTTATAA ACAATAGTGT GACCGCAACC TCAGCTACAG G GG GAG C CAT TTACTGTAGT 78 0 

AGTACATCAG CCCCCAAACC AGTCTTAACT CTAT CAGACA ACGGGGAACT GAAC T T TATA 84 0 

G G AAAT AC AG C AAT T AC TAG TGGTGGGGCG ATTTATACTG ACAAT CTAGT TCTTTCTTCT 9 00 

GGAGGACCTA CGCTTTTTAA AAACAACTCT GCTATAGATA CTGCAGCTCC C T TAG GAG GA 9 60 

GCAATTGCGA TTGCTGACTC TGGATCTTTG AGTCTTTCGG CTCTTGGTGG AGACATCACT 102 0 

TTTGAAGGAA ACACAGTAGT CAAAGGAGCT TCTTCGAGTC AGACCACTAC CAGAAATTCT 108 0 
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ATTAACATCG 
ATCTACTTCT 
TTAAATGGTC 
GAGAAGCTCT 
CCTCTAACTC 
TCCTTTTCGC 
GCTGATGGGA 
AAGGCTACGC 
CTTGTAGATC 
TCTTGTCTCA 
GATCCCCTAG 
CAAGAG GAT A 
AATCCGAATC 
GATGTGCGCT 
GGCATCTGGT 
GGTTTTCGCC 
AATCTTATCA 
AAAAATAGAG 
TCTCCAAGCT 
GCTCAGATCA 
AAGGGAGAGA 
CACACTGCTT 
GCTTCGTACA 
GATAGCGGTG 
AGAAAC GAGC 
AATCCTGACT 
AATCTCTCAA 
CTTGAGGTCA 
GATCTTGGAG 



GAAACAC CAA 
AT GAT C C TAT 
CTGACCTTGC 
CGGAAGCAGA 
TTGCGGGAGG 
AATCTCCGGG 
T CACTAT CAA 
TAAAAGCAAC 
CTTCTGGAAA 
CTCTTACTGC 
AAAAAAATCC 
CTGCGACTAA 
CTGAGCGTCG 
C C AT AC AAC A 
GT GAAGGGAT 
ACATAAGT GC 
CTGCAGCCTT 
CTTCTGCCTA 
TGTTACGCTA 
GCTATATCTA 
GCTCGTGGTA 
TAAGCCATGA 
TACACCAAGA 
AT T T AAT T AA 
GTGCGTCTTA 
GCACGACAGC 
GACAAGCTGG 
CAAGTAACCT 
GTAAGTTCCA 



TGCTAAGATT 
AACAACTAAC 
AGGGAATCCT 
AGCTGCAGAA 
GCAACTCTCT 
CTCTACCCTC 
TAATCTTGTT 
ACAAGCAAGT 
TGTCTACGAA 
TGACGACCCC 
TATCCATTGG 
ATCCAAAGCA 
TGGAACCTTA 
GCTTGTAGCC 
CTCGAACTTC 
AGGTTATGTT 
CTGCCAATTA 
TGCAGCTTCT 
CCTTCCTGGA 
TAG T AAAAAT 
TAATGACGGT 
GGGTCTCTTC 
TAGCTTCAAA 
CGTCTCTGTG 
CGAAGCTACT 
TCTCCTAATC 
TATCGGAAGA 
ATCTATGGAA 
GTTCTAA 



GTACAGCTGC 
CAT AC T G C AG 
G CAT AT C AAG 
G CT GAT AAT C 
CTTAAATCAG 
CTCATGGATG 
CTCAATGTAG 
CAGACAGT CA 
GATGTCTCTT 
GCGAATATTC 
G GAT AC C AAG 
GCGACTCTTA 
GTTGCTAACA 
ACTAAAGTAC 
TTCCATAAAG 
GTAGGAGCGA 
TTCGGGAAAG 
CTCCATCTCC 
TCTGAAAGTG 
ACTAT GAAAA 
TGCGCTCTGG 
CACGCGTATT 
GAAC GTAATA 
CCTATTGGAA 
GTCATCTACG 
AACAATAC CT 
G CAG G GAT CT 
ATTCGTGGAT 



GAGCCTCTCA 
CTCTCTCAGA 
GAAC CAT C GT 
TCAAATCTAC 
GAGTCACTCT 
CAGGGACCAC 
ATTCCTTAAA 
CTTTATCTGG 
GGAATAACCC 
ACATCACAGA 
GGAATTGGGC 
CCTGGACAAA 
CGCTATGGGG 
GCCAATCTCA 
AT AG C AC G AA 
C T AC AAC AT T 
AT AG AGAT CA 
AGCATCTAGC 
AGCAGCCTGT 
CCTATTACAC 
AACTTGCGAG 
TTCCTTTCAT 
CTACCTTGGT 
TTACCTTCGA 
TTGCCGATGT 
CGTGGAAAAC 
TTTATGCCTT 
CTTCACGCAG 



AGGCAATACT 
TGCTCTAAAC 
ATTTTCTGGA 
AATTCAGCAA 
AGTTGCTAAG 
ATT AGAAAC C 
AGAGAC CAAG 
ATCGCTCTCT 
TCAAGTCTTT 
CTTAGCTGCT 
ATTATCTTGG 
AAC AG GAT AC 
ATCCTTTGTT 
AGAAACTCGC 
GATAAATAAA 
AGCTTCTGAT 
CTTTATAAAT 
GACCTTGTCT 
CCTCTTTGAT 
CCAAGCACCA 
CTCCCTACCA 
CAAAGTAGAA 
ACGATCTTTC 
GAGATTCTCG 
CT AT C GTAAG 
T AC AG GAAC G 
CTCTCCAAAT 
C T AC AAT G C A 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2787 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Met Lys Ser Ser Leu His Trp Phe Val lie Ser Ser Ser Leu Ala Leu 



10 



15 



Pro Leu Ser Leu Asn Phe Ser Ala Phe Ala Ala Val Val Glu lie Asn 



20 



25 



30 



Leu Gly Pro Thr Asn Ser Phe Ser Gly Pro Gly Thr Tyr Thr Pro Pro 



35 



40 



45 



Ala Gin Thr Thr Asn Ala Asp Gly Thr lie Tyr Asn Leu Thr Gly Asp 



50 



55 



60 



Val Ser lie Thr Asn Ala Gly Ser Pro Thr Ala Leu Thr Ala Ser Cys 



65 



70 



75 



80 



Phe Lys Glu Thr Thr Gly Asn Leu Ser Phe Gin Gly His Gly Tyr Gin 
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85 90 95 

Phe Leu Leu Gin Asn lie Asp Ala Gly Ala Asn Cys Thr Phe Thr Asn 
100 105 110 

Thr Ala Ala Asn Lys Leu Leu Ser Phe Ser Gly Phe Ser Tyr Leu Ser 
115 120 125 

Leu lie Gin Thr Thr Asn Ala Thr Thr Gly Thr Gly Ala lie Lys Ser 
130 135 140 

Thr Gly Ala Cys Ser lie Gin Ser Asn Tyr Ser Cys Tyr Phe Gly Gin 
145 150 155 160 

Asn Phe Ser Asn Asp Asn Gly Gly Ala Leu Gin Gly Ser Ser lie Ser 
165 170 175 

Leu Ser Leu Asn Pro Asn Leu Thr Phe Ala Lys Asn Lys Ala Thr Gin 
180 185 190 

Lys Gly Gly Ala Leu Tyr Ser Thr Gly Gly lie Thr lie Asn Asn Thr 

195 200 205 

Leu Asn Ser Ala Ser Phe Ser Glu Asn Thr Ala Ala Asn Asn Gly Gly 
210 215 220 

Ala lie Tyr Thr Glu Ala Ser Ser Phe lie Ser Ser Asn Lys Ala lie 
225 230 235 240 

Ser Phe lie Asn Asn Ser Val Thr Ala Thr Ser Ala Thr Gly Gly Ala 
245 250 255 

lie Tyr Cys Ser Ser Thr Ser Ala Pro Lys Pro Val Leu Thr Leu Ser 
260 265 270 

Asp Asn Gly Glu Leu Asn Phe lie Gly Asn Thr Ala lie Thr Ser Gly 
275 280 285 

Gly Ala lie Tyr Thr Asp Asn Leu Val Leu Ser Ser Gly Gly Pro Thr 
290 295 300 

Leu Phe Lys Asn Asn Ser Ala lie Asp Thr Ala Ala Pro Leu Gly Gly 
305 310 315 320 

Ala lie Ala lie Ala Asp Ser Gly Ser Leu Ser Leu Ser Ala Leu Gly 
325 330 335 

Gly Asp lie Thr Phe Glu Gly Asn Thr Val Val Lys Gly Ala Ser Ser 
340 345 350 

Ser Gin Thr Thr Thr Arg Asn Ser lie Asn lie Gly Asn Thr Asn Ala 
355 360 365 

Lys lie Val Gin Leu Arg Ala Ser Gin Gly Asn Thr He Tyr Phe Tyr 
370 375 380 



Asp Pro He Thr Thr Asn His Thr Ala Ala Leu Ser Asp Ala Leu Asn 
385 390 395 400 
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Leu Asn 

Val Phe 

Asn Leu 

Leu Ser 
450 

Ser Pro 
465 

Ala Asp 

Lys Glu 

Val Thr 

Tyr Glu 
530 

Leu Thr 
545 

Asp Pro 

Ala Leu 

Leu Thr 

Thr Leu 
610 

lie Gin 
625 

Gly lie 

Lys lie 

Ala Thr 

Gin Leu 
690 



Gly Pro 

Ser Gly 
420 

Lys Ser 
435 

Leu Lys 

Gly Ser 

Gly lie 

Thr Lys 
500 

Leu Ser 
515 

Asp Val 

Ala Asp 

Leu Glu 

Ser Trp 
580 

Trp Thr 
595 

Val Ala 

Gin Leu 

Trp Cys 

Asn Lys 
660 

Thr Thr 
675 

Phe Gly 



Asp Leu 
405 

Glu Lys 

Thr lie 

Ser Gly 

Thr Leu 
470 

Thr lie 
485 

Lys Ala 

Gly Ser 

Ser Trp 

Asp Pro 
550 

Lys Asn 
565 

Gin Glu 

Lys Thr 

Asn Thr 

Val Ala 
630 

Glu Gly 
645 

Gly Phe 
Leu Ala 
Lys Asp 



Ala Gly 

Leu Ser 

Gin Gin 
440 

Val Thr 
455 

Leu Met 
Asn Asn 



Thr Leu 



Leu Ser 
520 

Asn Asn 
535 

Ala Asn 

Pro lie 

Asp Thr 

Gly Tyr 
600 

Leu Trp 
615 

Thr Lys 

lie Ser 

Arg His 

Ser Asp 
680 

Arg Asp 
695 



Asn Pro 
410 

Glu Ala 
425 

Pro Leu 



Leu Val 



Asp Ala 



Leu Val 
490 

Lys Ala 
505 

Leu Val 

Pro Gin 

lie His 

His Trp 
570 

Ala Thr 
585 

Asn Pro 

Gly Ser 

Val Arg 

Asn Phe 
650 

lie Ser 
665 

Asn Leu 



His Phe 



Ala Tyr 

Glu Ala 

Thr Leu 

Ala Lys 
460 

Gly Thr 
475 

Leu Asn 

Thr Gin 

Asp Pro 

Val Phe 
540 

lie Thr 
555 

Gly Tyr 

Lys Ser 

Asn Pro 

Phe Val 
620 

Gin Ser 
635 

Phe His 

Ala Gly 

lie Thr 

lie Asn 
700 



Gin Gly Thr lie 
415 

Ala Glu Ala Asp 
430 

Ala Gly Gly Gin 
445 

Ser Phe Ser Gin 



Thr Leu Glu Thr 
480 

Val Asp Ser Leu 
495 

Ala Ser Gin Thr 
510 

Ser Gly Asn Val 
525 

Ser Cys Leu Thr 



Asp Leu Ala Ala 
560 

Gin Gly Asn Trp 
575 

Lys Ala Ala Thr 
590 

Glu Arg Arg Gly 
605 

Asp Val Arg Ser 



Gin Glu Thr Arg 
640 

Lys Asp Ser Thr 
655 

Tyr Val Val Gly 
670 

Ala Ala Phe Cys 
685 

Lys Asn Arg Ala 
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Ser Ala Tyr Ala Ala Ser Leu His Leu Gin His Leu Ala Thr Leu Ser 
705 710 715 720 

Ser Pro Ser Leu Leu Arg Tyr Leu Pro Gly Ser Glu Ser Glu Gin Pro 
725 730 735 

Val Leu Phe Asp Ala Gin lie Ser Tyr lie Tyr Ser Lys Asn Thr Met 

740 745 750 

Lys Thr Tyr Tyr Thr Gin Ala Pro Lys Gly Glu Ser Ser Trp Tyr Asn 

755 760 765 

Asp Gly Cys Ala Leu Glu Leu Ala Ser Ser Leu Pro His Thr Ala Leu 
770 775 780 

Ser His Glu Gly Leu Phe His Ala Tyr Phe Pro Phe lie Lys Val Glu 
785 790 795 800 

Ala Ser Tyr lie His Gin Asp Ser Phe Lys Glu Arg Asn Thr Thr Leu 
805 810 815 

Val Arg Ser Phe Asp Ser Gly Asp Leu lie Asn Val Ser Val Pro lie 
820 825 830 

Gly lie Thr Phe Glu Arg Phe Ser Arg Asn Glu Arg Ala Ser Tyr Glu 
835 840 845 

Ala Thr Val lie Tyr Val Ala Asp Val Tyr Arg Lys Asn Pro Asp Cys 
850 855 860 

Thr Thr Ala Leu Leu lie Asn Asn Thr Ser Trp Lys Thr Thr Gly Thr 
865 870 875 880 

Asn Leu Ser Arg Gin Ala Gly lie Gly Arg Ala Gly lie Phe Tyr Ala 
885 890 895 

Phe Ser Pro Asn Leu Glu Val Thr Ser Asn Leu Ser Met Glu lie Arg 
900 905 910 

Gly Ser Ser Arg Ser Tyr Asn Ala Asp Leu Gly Gly Lys Phe Gin Phe 
915 920 925 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2793 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

AT G AAAAT AC CCTTGCACAA ACTCCTGATC TCTTCGACTC TTGTCACTCC CATTCTATTG 60 

AG CAT T G C AA CTTACGGAGC AGATGCTTCT TTATCCCCTA CAGATAGCTT TGATGGAGCG 12 0 

GGCGGCTCTA CATTTACTCC AAAAT C T AC A GCAGATGCCA ATGGAACGAA CTATGTCTTA 180 

TCAGGAAATG T C TAT AT AAA CGATGCTGGG AAAG G C AC AG CAT T AAC AG G CTGCTGCTTT 24 0 

ACAGAAACTA CGGGTGATCT G AC AT T T AC T GGAAAGGGAT ACTCATTTTC AT T C AAC AC G 3 00 
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GTAGATGCGG GTTCGAATGC AGGAGCTGCG GCAAGCACAA CTGCTGATAA AGCCCTAACA 3 60 

TTCACAGGAT TTTCTAACCT TTCCTTCATT GCAGCTCCTG GAACTACAGT TGCTTCAGGA 42 0 

AAAAGTACTT TAAGTTCTGC AGGAGCCTTA AATCTTACCG ATAATGGAAC GATTCTCTTT 48 0 

AGCCAAAACG TCTCCAATGA AGCTAATAAC AATGGCGGAG CGATCACCAC AAAAACTCTT 54 0 

TCTATTTCTG GGAATACCTC TTCTATAACC TTCACTAGTA ATAGCGCAAA AAAATTAGGT 60 0 

GGAGCGATCT ATAGCTCTGC GGCTGCAAGT AT T T C AG GAA ACACCGGCCA GTTAGTCTTT 660 

ATGAATAATA AAGGAGAAAC TGGGGGCGGG GCTCTGGGCT TTGAAGCCAG CTCCTCGATT 72 0 

ACT CAAAATA GCTCCCTTTT CTTCTCTGGA AACACT GCAA CAGATGCTGC AGGCAAGGGC 780 

GGGGCCATTT ATTGTGAAAA AACAGGAGAG ACTCCTACTC TTACTATCTC T G GAAAT AAA 84 0 

AGTCTGACCT TCGCCGAGAA CTCTTCAGTA ACTCAAGGCG GAGCAATCTG TGCCCATGGT 900 

CTAGATCTTT CCGCTGCTGG CCCTACCCTA TTTTCAAATA AT A GAT G C G G GAACACAGCT 960 

GCAGGCAAGG GCGGCGCTAT TGCAATTGCC GACTCTGGAT CTTTAAGTCT CTCTGCAAAT 102 0 

CAAGGAGACA TCACGTTCCT TGGCAACACT CTAACCTCAA CCTCCGCGCC AACATCGACA 108 0 

CGGAATGCTA TCTACCTGGG ATCGTCAGCA AAAATTACGA ACTTAAGGGC AGCCCAAGGC 114 0 

CAATCTATCT ATTTCTATGA TCCGATTGCA TCTAACACCA CAGGAGCTTC AGACGTTCTG 12 0 0 

ACCATCAACC AAC C G GAT AG CAACTCGCCT TTAGATTATT CAGGAACGAT TGTATTTTCT 1260 

GGGGAAAAGC TCTCTGCAGA TGAAGCGAAA GCTGCTGATA ACTTCACATC TAT AT T AAAG 132 0 

C AAC CAT T G G CTCTAGCCTC TGGAACCTTA G CAC T C AAAG GAAAT GT C GA GTTAGATGTC 138 0 

AATGGTTTCA CACAGACTGA AGGCTCTACA CTCCTCATGC AACCAGGAAC AAAGCTCAAA 144 0 

GC AG AT ACT G AAGCTATCAG TCTTACCAAA CTTGTCGTTG ATCTTTCTGC CTTAGAGGGA 1500 

AATAAGAGTG TGTCCATTGA AAC AG C AG GA GCCAACAAAA CTATAACTCT AACCTCTCCT 1560 

CTTGTTTTCC AAG AT AG TAG CGGCAATTTT TAT G AAAG C C AT AC GAT AAA CCAAGCCTTC 162 0 

ACGCAGCCTT TGGTGGTATT CACTGCTGCT ACTGCTGCTA G C GAT AT T T A TAT C GAT GC G 168 0 

CTTCTCACTT CTCCAGTACA AACTCCAGAA CCTCATTACG GGTATCAGGG ACAT T GGGAA 174 0 

GCCACTTGGG C AGAC AC AT C AACTGCAAAA TCAGGAACTA TGACTTGGGT AACTACGGGC 1800 

TACAACCCTA ATCCTGAGCG TAGAGCTTCC GTAGTTCCCG AT T CAT TAT G GGCATCCTTT 18 60 

ACTGACATTC GCACTCTACA G C AGAT CAT G ACATCTCAAG CGAATAGTAT C TAT C AG C AA 192 0 

CGAGGACTCT GGGCATCAGG AACTGCGAAT TTCTTCCATA AGGATAAATC AGGAACTAAC 198 0 

C AAG CAT T C C GACATAAAAG CTACGGCTAT ATTGTTGGAG GAAGT GCT GA AGATTTTTCT 2 04 0 

GAAAATATCT TCAGTGTAGC TTTCTGCCAG CTCTTCGGTA AAGATAAAGA CCTGTTTATA 2100 

GT T GAAAAT A CCTCTCATAA CTATTTAGCG TCGCTATACC T GCAA CAT CG AGCATTCCTA 2160 

GGAGGACTTC CCATGCCCTC AT T T G GAAGT ATCACCGACA TGCTGAAAGA TATTCCTCTC 222 0 

ATTTTGAATG C C C AG C T AAG CTACAGCTAC AC T AAAAAT G AT AT G GAT AC TCGCTATACT 22 8 0 

TCCTATCCTG AAGCT CAAGG TTCTTGGACC AATAATTCTG GGGCTCTAGA GCTCGGAGGA 2 34 0 

TCTCTGGCTC TATATCTCCC T AAAGAAG C A CCGTTCTTCC AG G GAT AT T T CCCCTTCTTA 2 4 00 

AAGTTCCAGG CAGT CTACAG CCGCCAACAA AACTTTAAAG AGAGTGGCGC TGAAGCCCGT 2 4 60 

GCTTTTGATG ATGGAGACCT AGTGAACTGC TCTATCCCTG TCGGCATTCG GT T AGAAAAA 2 52 0 

ATCTCCGAAG ATGAAAAAAA TAATTTCGAG ATTTCTCTAG C C AAC AT T G G TGATGTGTAT 2580, 

CGTAAAAATC CCCGTTCGCG TACTTCTCTA ATGGTCAGTG GAGCCTCTTG GACTTCGCTA 2 64 0 

TGTAAAAACC TCGCACGACA AGCCTTCTTA GCAAGT GCT G GAAG C CAT C T GACTCTCTCC 270 0 

C CT CAT GT AG AACTCTCTGG GGAAGCTGCT TATGAGCTTC GTGGCTCAGC AC AC AT C T AC 27 60 

AATGTAGATT GTGGGCTAAG AT AC T CAT T C TAG 2 7 93 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Met Lys lie Pro Leu His Lys Leu Leu lie Ser Ser Thr Leu Val Thr 
15 10 15 



70 



Pro lie Leu Leu 
20 

Pro Thr Asp Ser 
35 

Ser Thr Ala Asp 
50 

Tyr lie Asn Asp 
65 

Thr Glu Thr Thr 



Ser Phe Asn Thr 
100 

Thr Thr Ala Asp 
115 

Phe lie Ala Ala 
130 

Ser Ser Ala Gly 
145 

Ser Gin Asn Val 



Thr Lys Thr Leu 
180 

Ser Asn Ser Ala 
195 

Ala Ser lie Ser 
210 

Gly Glu Thr Gly 
225 

Thr Gin Asn Ser 



Ala Gly Lys Gly 
260 

Thr Leu Thr lie 
275 

Ser Val Thr Gin 
290 

Ala Ala Gly Pro 
305 

Ala Gly Lys Gly 



Ser lie Ala Thr 



Phe Asp Gly Ala 
40 

Ala Asn Gly Thr 
55 

Ala Gly Lys Gly 
70 

Gly Asp Leu Thr 
85 

Val Asp Ala Gly 



Lys Ala Leu Thr 
120 

Pro Gly Thr Thr 
135 

Ala Leu Asn Leu 
150 

Ser Asn Glu Ala 
165 

Ser lie Ser Gly 



Lys Lys Leu Gly 
200 

Gly Asn Thr Gly 
215 

Gly Gly Ala Leu 
230 

Ser Leu Phe Phe 
245 

Gly Ala lie Tyr 



Ser Gly Asn Lys 
280 

Gly Gly Ala lie 
295 

Thr Leu Phe Ser 
310 

Gly Ala lie Ala 
325 



Tyr Gly Ala Asp 
25 

Gly Gly Ser Thr 



Asn Tyr Val Leu 
60 

Thr Ala Leu Thr 
75 

Phe Thr Gly Lys 
90 

Ser Asn Ala Gly 
105 

Phe Thr Gly Phe 



Val Ala Ser Gly 

140 

Thr Asp Asn Gly 
155 

Asn Asn Asn Gly 
170 

Asn Thr Ser Ser 
185 

Gly Ala lie Tyr 



Gin Leu Val Phe 
220 

Gly Phe Glu Ala 
235 

Ser Gly Asn Thr 
250 

Cys Glu Lys Thr 
265 

Ser Leu Thr Phe 



Cys Ala His Gly 
300 

Asn Asn Arg Cys 
315 

lie Ala Asp Ser 
330 



Ala Ser Leu Ser 
30 

Phe Thr Pro Lys 
45 

Ser Gly Asn Val 

Gly Cys Cys Phe 
80 

Gly Tyr Ser Phe 
95 

Ala Ala Ala Ser 
110 

Ser Asn Leu Ser 
125 

Lys Ser Thr Leu 



Thr lie Leu Phe 
160 

Gly Ala lie Thr 
175 

lie Thr Phe Thr 
190 

Ser Ser Ala Ala 
205 

Met Asn Asn Lys 



Ser Ser Ser lie 
240 

Ala Thr Asp Ala 
255 

Gly Glu Thr Pro 
270 

Ala Glu Asn Ser 
285 

Leu Asp Leu Ser 



Gly Asn Thr Ala 
320 

Gly Ser Leu Ser 
335 
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Leu Ser Ala Asn Gin Gly Asp lie Thr Phe Leu Gly Asn Thr Leu Thr 
340 345 350 

Ser Thr Ser Ala Pro Thr Ser Thr Arg Asn Ala lie Tyr Leu Gly Ser 
355 360 365 

Ser Ala Lys lie Thr Asn Leu Arg Ala Ala Gin Gly Gin Ser lie Tyr 
370 375 380 

Phe Tyr Asp Pro lie Ala Ser Asn Thr Thr Gly Ala Ser Asp Val Leu 
385 390 395 400 

Thr lie Asn Gin Pro Asp Ser Asn Ser Pro Leu Asp Tyr Ser Gly Thr 
405 410 415 

lie Val Phe Ser Gly Glu Lys Leu Ser Ala Asp Glu Ala Lys Ala Ala 
420 425 430 

Asp Asn Phe Thr Ser lie Leu Lys Gin Pro Leu Ala Leu Ala Ser Gly 
435 440 445 

Thr Leu Ala Leu Lys Gly Asn Val Glu Leu Asp Val Asn Gly Phe Thr 
450 455 460 

Gin Thr Glu Gly Ser Thr Leu Leu Met Gin Pro Gly Thr Lys Leu Lys 
465 470 475 480 

Ala Asp Thr Glu Ala lie Ser Leu Thr Lys Leu Val Val Asp Leu Ser 
485 490 495 

Ala Leu Glu Gly Asn Lys Ser Val Ser lie Glu Thr Ala Gly Ala Asn 
500 505 510 

Lys Thr lie Thr Leu Thr Ser Pro Leu Val Phe Gin Asp Ser Ser Gly 
515 520 525 

Asn Phe Tyr Glu Ser His Thr lie Asn Gin Ala Phe Thr Gin Pro Leu 
530 535 540 

Val Val Phe Thr Ala Ala Thr Ala Ala Ser Asp lie Tyr lie Asp Ala 
545 550 555 560 

Leu Leu Thr Ser Pro Val Gin Thr Pro Glu Pro His Tyr Gly Tyr Gin 

565 570 575 

Gly His Trp Glu Ala Thr Trp Ala Asp Thr Ser Thr Ala Lys Ser Gly 
580 585 590 

Thr Met Thr Trp Val Thr Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg 
595 600 605 

Ala Ser Val Val Pro Asp Ser Leu Trp Ala Ser Phe Thr Asp lie Arg 
610 615 620 

Thr Leu Gin Gin lie Met Thr Ser Gin Ala Asn Ser lie Tyr Gin Gin 
625 630 635 640 
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Arg Gly Leu Trp Ala Ser Gly Thr Ala Asn Phe Phe His Lys Asp Lys 
645 650 655 

Ser Gly Thr Asn Gin Ala Phe Arg His Lys Ser Tyr Gly Tyr lie Val 
660 665 670 

Gly Gly Ser Ala Glu Asp Phe Ser Glu Asn lie Phe Ser Val Ala Phe 
675 680 685 

Cys Gin Leu Phe Gly Lys Asp Lys Asp Leu Phe lie Val Glu Asn Thr 
690 695 700 

Ser His Asn Tyr Leu Ala Ser Leu Tyr Leu Gin His Arg Ala Phe Leu 
705 710 715 720 

Gly Gly Leu Pro Met Pro Ser Phe Gly Ser lie Thr Asp Met Leu Lys 
725 730 735 

Asp lie Pro Leu lie Leu Asn Ala Gin Leu Ser Tyr Ser Tyr Thr Lys 
740 745 750 

Asn Asp Met Asp Thr Arg Tyr Thr Ser Tyr Pro Glu Ala Gin Gly Ser 
755 760 765 

Trp Thr Asn Asn Ser Gly Ala Leu Glu Leu Gly Gly Ser Leu Ala Leu 
770 775 780 

Tyr Leu Pro Lys Glu Ala Pro Phe Phe Gin Gly Tyr Phe Pro Phe Leu 
785 790 795 800 

Lys Phe Gin Ala Val Tyr Ser Arg Gin Gin Asn Phe Lys Glu Ser Gly 
805 810 815 

Ala Glu Ala Arg Ala Phe Asp Asp Gly Asp Leu Val Asn Cys Ser lie 
820 825 830 

Pro Val Gly lie Arg Leu Glu Lys lie Ser Glu Asp Glu Lys Asn Asn 
835 840 845 

Phe Glu lie Ser Leu Ala Asn lie Gly Asp Val Tyr Arg Lys Asn Pro 
850 855 860 

Arg Ser Arg Thr Ser Leu Met Val Ser Gly Ala Ser Trp Thr Ser Leu 
865 870 875 880 

Cys Lys Asn Leu Ala Arg Gin Ala Phe Leu Ala Ser Ala Gly Ser His 
885 890 895 

Leu Thr Leu Ser Pro His Val Glu Leu Ser Gly Glu Ala Ala Tyr Glu 
900 905 910 

Leu Arg Gly Ser Ala His lie Tyr Asn Val Asp Cys Gly Leu Arg Tyr 
915 920 925 

Ser Phe 
930 



(2) INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GAAGACAATA TAAGGTACCG T C AT AAC AG C GGGGGTTATG CACTAGGGAT CACAGCAACA 6 0 

ACTCCTGCCG AGGAT CAGCT TACTTTTGCC TTCTGCCAGC TCTTTGCTAG AGATCGCAAT 12 0 

CAT AT T AC AG GTAAGAACCA CGGAGATACT TACGGTGCCT CTTTGTATTT CCACCATACA 18 0 

GAAGGGCTCT TCGACATCGC CAATTTCCTC TGGGGAAAAG CAACCCGAGC TCCCTGGGTG 24 0 

CTCTCTGAGA TCTCCCAGAT CATTCCTTTA TCGTTCGATG CTAAATTCAG TTATCTCCAT 30 0 

ACAGACAACC ACAT GAAGAC AT AT TAT AC C GATAACTCTA TCATCAAGGG TTCTTGGAGA 3 60 

AAC GAT G C CT TCTGTGCAGA TCTTGGAGCT AGCCTGCCTT TTGTTATTTC CGTTCCGTAT 42 0 

CTTCTGAAAG AAGTCGAACC TTTTGTCAAA GTACAGTATA TCTATGCGCA TCAGCAAGAC 48 0 

TTCTACGAGC GTCATGCTGA AGGACGCGCT T T C AAT AAAA GCGAGCTTAT C AAC G TAG AG 54 0 

ATTCCTATAG GCGTCACCTT CGAAAGAGAC T CAAAAT C AG AAAAGGGAAC TTACGATCTT 60 0 

ACTCTTATGT ATATACT CGA TGCTTACCGA CGCAATCCTA AATGTCAAAC TTCCCTAATA 660 

GCTAGCGATG CTAACT GGAT GGCCTATGGT ACCAACCTCG CACGACAAGG TTTTTCTGTT 72 0 

CGTGCTGCGA ACCATTTCCA AGTGAACCCC CACATGGAAA TCTTCGGTCA ATTCGCTTTT 78 0 

GAAGTAC GAA GTTCTTCACG AAATTATAAT ACAAACCTAG GCTCTAAGTT TTGTTTCTAG 84 0 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Glu Asp Asn lie Arg Tyr Arg His Asn Ser Gly Gly Tyr Ala Leu Gly 
15 10 15 

lie Thr Ala Thr Thr Pro Ala Glu Asp Gin Leu Thr Phe Ala Phe Cys 

20 25 30 

Gin Leu Phe Ala Arg Asp Arg Asn His lie Thr Gly Lys Asn His Gly 
35 40 45 

Asp Thr Tyr Gly Ala Ser Leu Tyr Phe His His Thr Glu Gly Leu Phe 
50 55 60 

Asp lie Ala Asn Phe Leu Trp Gly Lys Ala Thr Arg Ala Pro Trp Val 
65 70 ' 75 80 

Leu Ser Glu lie Ser Gin lie lie Pro Leu Ser Phe Asp Ala Lys Phe 
85 90 95 



Ser Tyr Leu His Thr Asp Asn His Met Lys Thr Tyr Tyr Thr Asp Asn 
100 105 110 
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Ser lie lie Lys Gly Ser Trp Arg Asn Asp Ala Phe Cys Ala Asp Leu 
115 120 125 

Gly Ala Ser Leu Pro Phe Val lie Ser Val Pro Tyr Leu Leu Lys Glu 
130 135 140 

Val Glu Pro Phe Val Lys Val Gin Tyr lie Tyr Ala His Gin Gin Asp 
145 150 155 160 

Phe Tyr Glu Arg His Ala Glu Gly Arg Ala Phe Asn Lys Ser Glu Leu 
165 170 175 

lie Asn Val Glu lie Pro lie Gly Val Thr Phe Glu Arg Asp Ser Lys 
180 185 190 

Ser Glu Lys Gly Thr Tyr Asp Leu Thr Leu Met Tyr lie Leu Asp Ala 
195 200 205 

Tyr Arg Arg Asn Pro Lys Cys Gin Thr Ser Leu lie Ala Ser Asp Ala 
210 215 220 

Asn Trp Met Ala Tyr Gly Thr Asn Leu Ala Arg Gin Gly Phe Ser Val 
225 230 235 240 

Arg Ala Ala Asn His Phe Gin Val Asn Pro His Met Glu lie Phe Gly 
245 250 255 

Gin Phe Ala Phe Glu Val Arg Ser Ser Ser Arg Asn Tyr Asn Thr Asn 
260 265 270 

Leu Gly Ser Lys Phe Cys Phe 
275 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION:, SEQ ID NO: 19: 

AT GACCATAC TTCGAAATTT TCTTACCTGC TCGGCTTTAT TCCTCGCTCT CCCTGCAGCA 60 

GCACAAGTTG TAT AT C T T C A T GAAAGT GAT GGTTATAACG GTGCTATCAA TAATAAAAGC 12 0 

TTAGAACCTA AAATTACCTG TTATCCAGAA GGAACTT CTT ACATCTTTCT AGATGACGTG 180 

AGGATTTCCA ACGTTAAGCA TGATCAAGAA GATGCTGGGG TTTTTATAAA TCGATCTGGG 24 0 

AATCTTTTTT TCATGGGCAA CCGTTGCAAC TTCACTTTTC ACAACCTTAT GACCGAGGGT 300 

TTTGGCGCTG CCATTTCGAA CCGCGTTGGA GACACCACTC TCACTCTCTC TAATTTTTCT 3 60 

TACTTAACGT TCACCTCAGC ACCTCTACTA CCTCAAGGAC AAGGAGCGAT TTATAGTCTT 42 0 

GGTTCCGTGA TGATCGAAAA TAGTGAGGAA GTGACTTTCT GTGGGAACTA CTCTTCGTGG 4 80 

AGTGGAGCTG C GAT T TAT AC TCCCTACCTT TTAGGTTCTA AGGCGAGTCG TCCTTCAGTA 54 0 

AATCTCAGCG GGAACCGCTA CCTGGTGTTT AGAGACT AT G TGAGCCAAGG TTATGGCGGC 600 

GCCGTATCTA CCCACAATCT CACACTCACG ACTCGAGGAC CTTCGTGTTT TGAAAATAAT 660 

CATGCTTATC ATGACGTGAA TAGTAATGGA GGAG C CAT T G CCATTGCTCC TGGAGGATCG 72 0 
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ATCTCTATAT C C GT GAAAAG CGGAGATCTC ATCTTCAAAG GAAAT AC AG C AT C AC AAGAC 7 80 

GGAAATACAA T AC AC AAC T C CATC CAT CTG CAATCTGGAG CACAGT T T AA GAACCTACGT 84 0 

GCTGTTTCAG AATCCGGAGT TTATTTCTAT GAT C CTATAA GCCATAGCGA GTCGCATAAA 900 

ATTACAGATC TTGTAATCAA TGCTCCTGAA G G AAAG G AAA CTTATGAAGG AACAAT TAG C 960 

TTCT CAGGAC TATGCCTGGA T GAT CAT GAA GTTTGTGCGG AAAATCTTAC TTCCACAATC 102 0 

CTACAAGATG T C AC AT TAG C AGGAGGAACT CTCTCTCTAT CGGATGGGGT TACCTTGCAA 108 0 

CTGCATTCTT TTAAGCAGGA AGCAAGCTCT ACGCTTACTA TGTCTCCAGG AAC C AC T CT G 114 0 

CTCTGCTCAG GAGATGCTCG GGTTCAGAAT CTGCACATCC TGATTGAAGA TACCGACAAC 12 0 0 

TTTGTTCCTG TAAGGATTCG CGCCGAGGAC AAGGATGCTC TTGTCTCATT AGAAAAACTT 12 60 

AAAGTTGCCT TTGAGGCTTA TTGGTCCGTC TATGACTTTC CTCAATTTAA GGAAGCCTTT 132 0 

ACGATTCCTC TTCTTGAACT TCTAGGGCCT TCTTTTGACA GTCTTCTCCT AGGGGAGACC 138 0 

ACTTTGGAGA GAACCCAAGT CACAACAGAG AATGACGCCG TTCGAGGTTT CTGGTCCCTA 14 4 0 

AGCT GGGAAG AGTACCCCCC TTCTCTGGAT AAAGACAGAA G GAT C AC AC C AACTAAGAAA 15 00 

ACTGTTTTCC TCACTTGGAA TCCTGAGATC ACTTCTACGC CATAA 1545 

(2) INFORMATION FOR SEQ ID NO : 2 0 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Thr lie Leu Arg Asn Phe Leu Thr Cys Ser Ala Leu Phe Leu Ala 
15 10 15 

Leu Pro Ala Ala Ala Gin Val Val Tyr Leu His Glu Ser Asp Gly Tyr 
20 25 30 

Asn Gly Ala lie Asn Asn Lys Ser Leu Glu Pro Lys lie Thr Cys Tyr 
35 40 45 

Pro Glu Gly Thr Ser Tyr lie Phe Leu Asp Asp Val Arg lie Ser Asn 
50 55 60 

Val Lys His Asp Gin Glu Asp Ala Gly Val Phe lie Asn Arg Ser Gly 
65 70 75 80 

Asn Leu Phe Phe Met Gly Asn Arg Cys Asn Phe Thr Phe His Asn Leu 
85 90 95 

Met Thr Glu Gly Phe Gly Ala Ala lie Ser Asn Arg Val Gly Asp Thr 
100 105 110 

Thr Leu Thr Leu Ser Asn Phe Ser Tyr Leu Thr Phe Thr Ser Ala Pro 
115 120 125 

Leu Leu Pro Gin Gly Gin Gly Ala lie Tyr Ser Leu Gly Ser Val Met 
130 135 140 

lie Glu Asn Ser Glu Glu Val Thr Phe Cys Gly Asn Tyr Ser Ser Trp 
145 150 155 160 



Ser Gly Ala Ala lie Tyr Thr Pro Tyr Leu Leu Gly Ser Lys Ala Ser 
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165 170 175 

Arg Pro Ser Val Asn Leu Ser Gly Asn Arg Tyr Leu Val Phe Arg Asp 
180 185 190 

Tyr Val Ser Gin Gly Tyr Gly Gly Ala Val Ser Thr His Asn Leu Thr 
195 200 205 

Leu Thr Thr Arg Gly Pro Ser Cys Phe Glu Asn Asn His Ala Tyr His 
210 215 220 

Asp Val Asn Ser Asn Gly Gly Ala lie Ala lie Ala Pro Gly Gly Ser 
225 230 235 240 

lie Ser lie Ser Val Lys Ser Gly Asp Leu lie Phe Lys Gly Asn Thr 
245 250 255 

Ala Ser Gin Asp Gly Asn Thr lie His Asn Ser lie His Leu Gin Ser 
260 265 270 

Gly Ala Gin Phe Lys Asn Leu Arg Ala Val Ser Glu Ser Gly Val Tyr 
275 280 285 

Phe Tyr Asp Pro lie Ser His Ser Glu Ser His Lys lie Thr Asp Leu 
290 295 300 

Val lie Asn Ala Pro Glu Gly Lys Glu Thr Tyr Glu Gly Thr lie Ser 
305 310 315 320 

Phe Ser Gly Leu Cys Leu Asp Asp His Glu Val Cys Ala Glu Asn Leu 
325 * 330 335 

Thr Ser Thr lie Leu Gin Asp Val Thr Leu Ala Gly Gly Thr Leu Ser 
340 345 350 

Leu Ser Asp Gly Val Thr Leu Gin Leu His Ser Phe Lys Gin Glu Ala 
355 . 360 365 

Ser Ser Thr Leu Thr Met Ser Pro Gly Thr Thr Leu Leu Cys Ser Gly 
370 375 380 

Asp Ala Arg Val Gin Asn Leu His lie Leu lie Glu Asp Thr Asp Asn 
385 390 395 400 

Phe Val Pro Val Arg lie Arg Ala Glu Asp Lys Asp Ala Leu Val Ser 
405 410 415 

Leu Glu Lys Leu Lys Val Ala Phe Glu Ala Tyr Trp Ser Val Tyr Asp 
420 425 430 

Phe Pro Gin Phe Lys Glu Ala Phe Thr lie Pro Leu Leu Glu Leu Leu 
435 440 445 

Gly Pro Ser Phe Asp Ser Leu Leu Leu Gly Glu Thr Thr Leu Glu Arg 
450 455 460 



Thr Gin Val Thr Thr Glu Asn Asp Ala Val Arg Gly Phe Trp Ser Leu 
465 470 475 480 
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Ser Trp Glu Glu Tyr Pro Pro Ser Leu Asp Lys Asp Arg Arg lie Thr 
485 490 495 

Pro Thr Lys Lys Thr Val Phe Leu Thr Trp Asn Pro Glu lie Thr Ser 
500 505 510 

Thr Pro 



(2) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 

AT GAAAAC GT CTATTCGTAA GTTCTTAATT TCTACCACAC TGGCGCCATG TTTTGCTTCA 60 

ACAGCGTTTA CTGTAGAAGT TATCATGCCT T C C GAG AAC T TTGATGGATC GAGTGGGAAG 12 0 

ATTTTTCCTT ACACAACACT TTCTGATCCT AGAGGGACAC TCTGTATTTT TTCAGGGGAT 180 

CTCTACATTG CGAATCTTGA T AAT G C CAT A TCCAGAACCT CTTCCAGTTG CTTTAGCAAT 24 0 

AGGGCGGGAG CACTACAAAT C T TAG GAAAA GGTGGGGTTT TCTCCTTCTT AAATATCCGT 3 00 

TCTTCAGCTG ACGGAGCCGC GATTAGTAGT GT AAT C AC C C AAAATCCTGA ACTATGTCCC 3 60 

TTGAGTTTTT CAGGATTTAG TCAGATGATC T T C GAT AAC T GTGAATCTTT GACT T CAGAT 4 20 

ACCTCAGCGA GT AAT GT CAT ACCTCACGCA TCGGCGATTT ACGCTACAAC GCCCATGCTC 4 80 

TTTACAAACA AT GAC T C CAT AC TAT T C C AA T AC AAC C GT T CTGCAGGATT TGGAGCTGCC 54 0 

ATTCGAGGCA CAAGCATCAC AATAGAAAAT AC GAAAAAGA GCCTTCTCTT TAAT GGTAAT 600 

GGAT C CAT C T C TAAT G GAG G GGCCCTCACG GGATCTGCAG C GAT C AAC C T CAT CAACAAT 660 

AGCGCTCCTG TGATTTTCTC AACGAATGCT AC AG G GAT C T ATGGTGGGGC TATTTACCTT 72 0 

ACCGGAGGAT CTATGCTCAC CTCTGGGAAC CTCTCAGGAG TCTTGTTCGT T TAT AAT AG C 7 80 

TCGCGCT 7 87 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID. NO:22: 

Met Lys Thr Ser lie Arg Lys Phe Leu lie Ser Thr Thr Leu Ala Pro 
1 5 10 15 

Cys Phe Ala Ser Thr Ala Phe Thr Val Glu Val He Met Pro Ser Glu 
20 25 30 



Asn Phe Asp Gly Ser Ser Gly Lys He Phe Pro Tyr Thr Thr Leu Ser 
35 40 45 
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Asp Pro Arg Gly Thr Leu Cys lie Phe Ser Gly Asp Leu Tyr lie Ala 
50 55 60 

Asn Leu Asp Asn Ala lie Ser Arg Thr Ser Ser Ser Cys Phe Ser Asn 
65 70 75 80 

Arg Ala Gly Ala Leu Gin lie Leu Gly Lys Gly Gly Val Phe Ser Phe 
85 90 95 

Leu Asn lie Arg Ser Ser Ala Asp Gly Ala Ala lie Ser Ser Val lie 
100 105 110 

Thr Gin Asn Pro Glu Leu Cys Pro Leu Ser Phe Ser Gly Phe Ser Gin 
115 120 125 

Met lie Phe Asp Asn Cys Glu Ser Leu Thr Ser Asp Thr Ser Ala Ser 
130 135 140 

Asn Val lie Pro His Ala Ser Ala lie Tyr Ala Thr Thr Pro Met Leu 
145 150 155 160 

Phe Thr Asn Asn Asp Ser lie Leu Phe Gin Tyr Asn Arg Ser Ala Gly 
165 170 175 

Phe Gly Ala Ala lie Arg Gly Thr Ser lie Thr lie Glu Asn Thr Lys 
180 185 190 

Lys Ser Leu Leu Phe Asn Gly Asn Gly Ser lie Ser Asn Gly Gly Ala 
195 200 205 

Leu Thr Gly Ser Ala Ala lie Asn Leu lie Asn Asn Ser Ala Pro Val 
210 215 220 

lie Phe Ser Thr Asn Ala Thr Gly lie Tyr Gly Gly Ala lie Tyr Leu 
225 230 235 240 

Thr Gly Gly Ser Met Leu Thr Ser Gly Asn Leu Ser Gly Val Leu Phe 
245 250 255 

Val Tyr Asn Ser Ser Arg 
260 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ATGAAGACTT CAGTTTCTAT GTTGTTGGCC CTGCTTTGCT CGGGGGCTAG CTCTATTGTA 60 

CTCCATGCCG CAACCACTCC ACTAAATCCT GAAGATGGGT TTATTGGGGA GGGCAATACA 120 

AATACTTTTT CTCCGAAATC TACAACGGAT GCTGCAGGAA CTACCTACTC T C T C AC AG G A 180 

GAGGTTCTGT TTATAGATCC GGGGAAAGGT GGTTCAATTA CAGGAACTTG CTTTGTAGAA 2 40 




ACTGCTGGCG ATCTTACATT TTTAGGTAAT GGAAATACCC TAAAGTTCCT GTCGGTAGAT 300 

GCAGGTGCTA ATATCGCGGT TGCTCATGTA CAAG GAAGT A AGAATTTAAG CTTCACAGAT 3 60 

TTCCTTTCTC TGGTGATCAC AGAATCTCCA AAATCCGCTG TTAGTACAGG AAAAGGTAGC 42 0 

CTAGTCAGTT CAGGTGCAGT CCAACTGCAA GATATAAACA CTCTAGTTCT T AC AAG C AAT 48 0 

GCCTCTGTCG AAGATGGTGG CGTGATTAAA GGAAACTCCT GCTTGATTCA G G G AAT C AAA 54 0 

AATAGTGCGA TTTTTGGACA AAATACATCT TCGAAAAAAG GAGGGGCGAT CTCCACGACT 600 

CAAGGACTCA C CAT AGAGAA TAACTTAGGG ACGCTAAAGT TCAATGAAAA CAAAGCAGTG 660 

ACCTCAGGAG GCGCCTTAGA TTTAGGAGCC GCGTCTACAT TCACTGCGAA C CAT GAGT T G 72 0 

ATATTTTCAC AAAATAAGAC TTCTGGGAAT GCTGCAAATG GC G GAG C CAT AAATTGCTCA 78 0 

GGCGACCTAA CAT T T AC T GA TAACACTTCT TTGTTACTTC AAGAAAATAG CAC AAT G C AG 84 0 

GATGGTGGAG CTTTGTGTAG CACAGGAACC ATAAGCATTA CCGGTAGTGA T T C TAT C AAT 9 00 

GTGATAGGAA ATACTTCAGG AC AAAAAG GA GGAGCGATTT CTGCAGCTTC TCTCAAGATT 9 60 

TTGGGAGGGC AGGGAGGCGC TCTCTTTTCT AATAAC GT AG TGACTCATGC CACCCCTCTA 102 0 

GGAGGTGCCA TTTTTATCAA CAC AG GAG GA TCCTTGCAGC TCTTCACTCA AGGAGGGGAT 108 0 

ATCGTATTCG AGGGGAATCA GGTCACTACA ACAGCTCCAA AT G C T AC CAC TAAGAGAAAT 114 0 

GTAATTCACC T C GAGAG CAC CGCGAAGTGG ACGGGACTTG CTGCAAGTCA AGGTAACGCT 12 00 

ATCTATTTCT AT GAT C C CAT TACCACCAAC GAT AC G G GAG CAAG C GAT AA CTTACGTATC 12 60 

AAT GAG GT CA GTGCAAAT CA AAAGCTCTCG GGAT CTATAG TATTTTCTGG AGAGAGATTG 132 0 

TCGACAGCAG AAG CTATAG C TGAAAATCTT ACTTCGAGGA TCAACCAGCC TGTCACTTTA 138 0 

GTAGAGGGGA GCTTAGAACT TAAACAGGGA GTGACCTTGA T CAC AC AAG G ATTCTCGCAG 14 4 0 

GAG C C AGAAT CCACGCTTCT TTTGGATTTG GGGACCTCAT T AC AAG C T T C TACAGAAGAT 15 0 0 

AT C GT CAT CA C AAAT T CAT C TATAAATGCC GAT AC CAT T T AC G GAAAGAA TCCAATCAAT 15 6 0 

ATTGTAGCTT CAGCAGCGAA T AAGAAC AT T AC C CTAACAG GAAC C T TAG C ACTT GTAAAT 162 0 

GCAGATGGAG CTTTGTATGA GAAC CAT AC C TTGCAAGACT CTCAAGATTA TAGCTTTGTA 168 0 

AAGTTATCTC CAGGAGCGGG AGGGACTATA AT TACT CAAG ATGCTTCTCA GAAGCTTCTT 174 0 

GAAGTAGCTC CTTCTAGACC ACATTATGGC TAT CAAG GAC ATTGGAATGT GCAAGT CAT C 18 00 

CCAGGAACGG GAACTCAACC GAGCCAGGCA AAT T T AGAAT GGGTGCGGAC AG GAT AC C T T 18 60 

CCGAATCCCG AACGGCAAGG ATTTTTAGTT CCCAATAGCC TGTGGGGTTC TTTTGTTGAT 192 0 

CAGCGTGCTA TCCAAGAAAT CAT G GTAAAT AG TAG C C AAA TCTTATGTCA GGAACGGGGA 198 0 

GTCTGGGGAG CTGGAATTGC TAATTTCCTA CATAGAGATA AAAT T AAT G A GCACGGCTAT 2 04 0 

CGCCATAGCG GTGTCGGTTA TCTTGTGGGA GTTGGCACTC ATGCTTTTTC TGATGCTACG 2100 

ATAAATGCGG CTTTTTGCCA GCTCTTCAGT AGAGATAAAG ACTACGTAGT AT C C AAAAAT 2160 

CATGGAACTA GCTACTCAGG GGTCGTATTT C T T GAG GAT A CCCTAGAGTT TAGAAGTCCA 222 0 

CAGGGATTCT ATACTGATAG CTCCTCAGAA GCTTGCTGTA AC CAAGT C GT CACTATAGAT 22 8 0 

ATGCAGTTGT C T T AC AG C C A TAGAAATAAT GAT AT GAAAA C C AAAT AC AC GAC AT AT C C A 2 34 0 

GAAGCTCAGG GATCTTGGGC AAAT GAT GT T TTTGGTCTTG AGTTTGGAGC GAC T AC AT AC 24 0 0 

TACTACCCTA ACAGTACTTT TTTATTTGAT TACTACTCTC CGTTTCTCAG GCTGCAGTGC 2 4 60 

ACCTATGCTC ACCAGGAAGA CTT CAAAGAG ACAGGAGGTG AGGTTCGTCA CTTTACTAGC 2 52 0 

GGAGATCTTT T C AAT T TAG C AGTTCCTATT GGCGT GAAGT TTGAGAGATT TTCAGACTGT 258 0 

AAAAGGGGAT C T TAT GAAC T TACCCTTGCT TATGTTCCTG ATGTGATTCG CAAAGATCCC 2 64 0 

AAGAGCACGG CAACATTGGC TAGTGGAGCT ACGTGGAGCA CCCACGGAAA CAATCTCTCC 27 0 0 

AGACAAGGAT TACAACTGCG T T TAG G GAAC CACTGTCTCA TAAATCCTGG AATTGAGGTG 27 60 

TTCAGTCACG GAG CT AT T GA ATTGCGGGGA TCCTCTCGTA ATTATAACAT CAATCTCGGG 2 82 0 

GGTAAATACC GATTTTAA 2 838 

(2 ) ; INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 



Met Lys Thr Ser Val Ser Met Leu Leu Ala Leu Leu Cys Ser Gly Ala 
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15 10 15 

Ser Ser lie Val Leu His Ala Ala Thr Thr Pro Leu Asn Pro Glu Asp 
20 25 30 

Gly Phe lie Gly Glu Gly Asn Thr Asn Thr Phe Ser Pro Lys Ser Thr 
35 40 45 

Thr Asp Ala Ala Gly Thr Thr Tyr Ser Leu Thr Gly Glu Val Leu Phe 
50 55 60 

lie Asp Pro Gly Lys Gly Gly Ser lie Thr Gly Thr Cys Phe Val Glu 
65 70 75 80 

Thr Ala Gly Asp Leu Thr Phe Leu Gly Asn Gly Asn Thr Leu Lys Phe 
85 90 95 

Leu Ser Val Asp Ala Gly Ala Asn lie Ala Val Ala His Val Gin Gly 
100 105 110 

Ser Lys Asn Leu Ser Phe Thr Asp Phe Leu Ser Leu Val lie Thr Glu 
115 120 125 

Ser Pro Lys Ser Ala Val Ser Thr Gly Lys Gly Ser Leu Val Ser Ser 
130 135 140 

Gly Ala Val Gin Leu Gin Asp lie Asn Thr Leu Val Leu Thr Ser Asn 
145 150 155 160 

Ala Ser Val Glu Asp Gly Gly Val lie Lys Gly Asn Ser Cys Leu lie 
165 170 175 

Gin Gly lie Lys Asn Ser Ala lie Phe Gly Gin Asn Thr Ser Ser Lys 
180 185 190 

Lys Gly Gly Ala lie Ser Thr Thr Gin Gly Leu Thr lie Glu Asn Asn 
195 200 205 

Leu Gly Thr Leu Lys Phe Asn Glu Asn Lys Ala Val Thr Ser Gly Gly 
210 215 220 

Ala Leu Asp Leu Gly Ala Ala Ser Thr Phe Thr Ala Asn His Glu Leu 
225 230 235 240 

lie Phe Ser Gin Asn Lys Thr Ser Gly Asn Ala Ala Asn Gly Gly Ala 
245 250 255 

lie Asn Cys Ser Gly Asp Leu Thr Phe Thr Asp Asn Thr Ser Leu Leu 
260 265 270 

Leu Gin Glu Asn Ser Thr Met Gin Asp Gly Gly Ala Leu Cys Ser Thr 
275 280 285 

Gly Thr lie Ser lie Thr Gly Ser Asp Ser lie Asn Val lie Gly Asn 
290 295 300 



Thr Ser Gly Gin Lys Gly Gly Ala lie Ser Ala Ala Ser Leu Lys lie 
305 310 315 320 



81 



Leu Gly Gly Gin Gly Gly Ala Leu Phe Ser Asn Asn Val Val Thr His 
325 330 335 

Ala Thr Pro Leu Gly Gly Ala lie Phe lie Asn Thr Gly Gly Ser Leu 
340 345 350 

Gin Leu Phe Thr Gin Gly Gly Asp lie Val Phe Glu Gly Asn Gin Val 
355 360 365 

Thr Thr Thr Ala Pro Asn Ala Thr Thr Lys Arg Asn Val lie His Leu 
370 375 380 

Glu Ser Thr Ala Lys Trp Thr Gly Leu Ala Ala Ser Gin Gly Asn Ala 
385 390 395 400 

lie Tyr Phe Tyr Asp Pro lie Thr Thr Asn Asp Thr Gly Ala Ser Asp 
405 410 415 

Asn Leu Arg lie Asn Glu Val Ser Ala Asn Gin Lys Leu Ser Gly Ser 
420 425 430 

lie Val Phe Ser Gly Glu Arg Leu Ser Thr Ala Glu Ala lie Ala Glu 
435 440 445 

Asn Leu Thr Ser Arg lie Asn Gin Pro Val Thr Leu Val Glu Gly Ser 
450 455 460 

Leu Glu Leu Lys Gin Gly Val Thr Leu lie Thr Gin Gly Phe Ser Gin 
465 470 475 480 

Glu Pro Glu Ser Thr Leu Leu Leu Asp Leu Gly Thr Ser Leu Gin Ala 
485 490 495 

Ser Thr Glu Asp lie Val lie Thr Asn Ser Ser lie Asn Ala Asp Thr 
500 505 510 

lie Tyr Gly Lys Asn Pro lie Asn lie Val Ala Ser Ala Ala Asn Lys 
515 520 525 

Asn lie Thr Leu Thr Gly Thr Leu Ala Leu Val Asn Ala Asp Gly Ala 
530 535 540 

Leu Tyr Glu Asn His Thr Leu Gin Asp Ser Gin Asp Tyr Ser Phe Val 
545 550 555 560 

Lys Leu Ser Pro Gly Ala Gly Gly Thr lie lie Thr Gin Asp Ala Ser 
565 570 575 

Gin Lys Leu Leu Glu Val Ala Pro Ser Arg Pro His Tyr Gly Tyr Gin 
580 585 590 

Gly His Trp Asn Val Gin Val lie Pro Gly Thr Gly Thr Gin Pro Ser 
595 600 605 



Gin Ala Asn Leu Glu Trp Val Arg Thr Gly Tyr Leu Pro Asn Pro Glu 
610 615 620 
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Arg Gin Gly Phe Leu Val Pro Asn Ser Leu Trp Gly Ser Phe Val Asp 
625 630 635 640 

Gin Arg Ala lie Gin Glu lie Met Val Asn Ser Ser Gin lie Leu Cys 
645 650 655 

Gin Glu Arg Gly Val Trp Gly Ala Gly lie Ala Asn Phe Leu His Arg 
660 665 670 

Asp Lys lie Asn Glu His Gly Tyr Arg His Ser Gly Val Gly Tyr Leu 
675 680 685 

Val Gly Val Gly Thr His Ala Phe Ser Asp Ala Thr lie Asn Ala Ala 
690 695 700 

Phe Cys Gin Leu Phe Ser Arg Asp Lys Asp Tyr Val Val Ser Lys Asn 
705 710 715 720 

His Gly Thr Ser Tyr Ser Gly Val Val Phe Leu Glu Asp Thr Leu Glu 
725 730 735 

Phe Arg Ser Pro Gin Gly Phe Tyr Thr Asp Ser Ser Ser Glu Ala Cys 
740 745 750 

Cys Asn Gin Val Val Thr lie Asp Met Gin Leu Ser Tyr Ser His Arg 
755 760 765 

Asn Asn Asp Met Lys Thr Lys Tyr Thr Thr Tyr Pro Glu Ala Gin Gly 
770 775 780 

Ser Trp Ala Asn Asp Val Phe Gly Leu Glu Phe Gly Ala Thr Thr Tyr 
785 790 795 800 

Tyr Tyr Pro Asn Ser Thr Phe Leu Phe Asp Tyr Tyr Ser Pro Phe Leu 
805 810 815 

Arg Leu Gin Cys Thr Tyr Ala His Gin Glu Asp Phe Lys Glu Thr Gly 
820 825 830 

Gly Glu Val Arg His Phe Thr Ser Gly Asp Leu Phe Asn Leu Ala Val 
835 840 845 

Pro lie Gly Val Lys Phe Glu Arg Phe Ser Asp Cys Lys Arg Gly Ser 
850 855 860 

Tyr Glu Leu Thr Leu Ala Tyr Val Pro Asp Val lie Arg Lys Asp Pro 
865 870 875 880 

Lys Ser Thr Ala Thr Leu Ala Ser Gly Ala Thr Trp Ser Thr His Gly 
885 890 895 

Asn Asn Leu Ser Arg Gin Gly Leu Gin Leu Arg Leu Gly Asn His Cys 
900 905 910 

Leu He Asn Pro Gly He Glu Val Phe Ser His Gly Ala He Glu Leu 
915 920 925 



Arg Gly Ser Ser Arg Asn Tyr Asn He Asn Leu Gly Gly Lys Tyr Arg 
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930 935 940 



Phe 
945 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 259... 3000 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

AT C AG GT GAT AAAAGTTCCT CGTTAGCTAG TGACTGTAGG T GACAT GAGA AAGCTAACAC 60 

G GAG GAAAC T AAAAC C C AAG GAATCGAAGT CTTCATGGTA ATGCTTTTGT TTTTTAGAGA 12 0 

ACTATTCGCA TCAATATAGA AACAAAATAA GTAAATCAAG TTAAAGATGA CAAAACAGCT 18 0 

GT CAAGAAT T TTTATCTTGA CTCTCTGAGT TTTCTATTTT ATATGACGCA AGTAAGAATT 24 0 

TAATAATAAA GTGGGTTT ATG AAA TCG CAA TTT TCC TGG TTA GTG CTC TCT 2 91 

Met Lys Ser Gin Phe Ser Trp Leu Val Leu Ser 
15 10 

TCG ACA TTG GCA TGT TTT ACT AGT TGT TCC ACT GTT TTT GCT GCA ACT 33 9 

Ser Thr Leu Ala Cys Phe Thr Ser Cys Ser Thr Val Phe Ala Ala Thr 
15 20 25 

GCT GAA AAT ATA GGC CCC TCT GAT AGC TTT GAC GGA AGT ACT AAC ACA 3 87 

Ala Glu Asn lie Gly Pro Ser Asp Ser Phe Asp Gly Ser Thr Asn Thr 
30 35 40 

GGC ACC TAT ACT CCT AAA AAT ACG ACT ACT GGA ATA GAC TAT ACT CTG 4 35 

Gly Thr Tyr Thr Pro Lys Asn Thr Thr Thr Gly lie Asp Tyr Thr Leu 
45 50 55 

ACA GGA GAT ATA ACT CTG CAA AAC CTT GGG GAT TCG GCA GCT TTA ACG 4 83 

Thr Gly Asp lie Thr Leu Gin Asn Leu Gly Asp Ser Ala Ala Leu Thr 
60 65 70 75 

AAG GGT TGT TTT TCT GAC ACT ACG GAA TCT TTA AGC TTT GCC GGT AAG 531 
Lys Gly Cys Phe Ser Asp Thr Thr Glu Ser Leu Ser Phe Ala Gly Lys 
80 85 90 

GGG TAC TCA CTT TCT TTT TTA AAT ATT AAG TCT AGT GCT GAA GGC GCA 57 9 

Gly Tyr Ser Leu Ser Phe Leu Asn lie Lys Ser Ser Ala Glu Gly Ala 
95 100 105 



GCA CTT TCT GTT ACA ACT GAT AAA AAT CTG TCG CTA ACA GGA TTT TCG 



627 



1 
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Ala Leu Ser 
110 

AGT CTT ACT 
Ser Leu Thr 
125 

GGA AAA GGT 
Gly Lys Gly 
140 

GGA ACT ATT 
Gly Thr lie 



ATT TCT ACC 
lie Ser Thr 



TTT GAA GGG 
Phe Glu Gly 
190 

TGT GCT ACT 
Cys Ala Thr 
205 

TTC TCG AAC 
Phe Ser Asn 
220 

GGA AAC TGT 
Gly Asn Cys 



AGT GTG AC A 
Ser Val Thr 



GAT GTT ACC 
Asp Val Thr 

270 

GCT GTA GCT 
Ala Val Ala 

285 

TCC GGG GGG 
Ser Gly Gly 
300 

ACC ACT GCA 
Thr Thr Ala 



TGT AGT CTT 
Cys Ser Leu 



Val Thr Thr 



TTC TTA GCG 
Phe Leu Ala 



GCA GTT AAA 
Ala Val Lys 
145 

TTA TTT AAA 
Leu Phe Lys 
160 

AAG AAT CTT 
Lys Asn Leu 
175 

AAT AAA TCG 
Asn Lys Ser 



GGT ACT GTA 
Gly Thr Val 

AAT ATT GCT 
Asn lie Ala 

225 

AC A ATT AC A 
Thr lie Thr 
240 

GCG ACC GCA 
Ala Thr Ala 
255 

ATA TCT GGG 
lie Ser Gly 



AAT GGC GGA 
Asn Gly Gly 



GGG GGG GGT 

Gly Gly Gly 
305 

GGT AAT GGT 

Gly Asn Gly 
320 

TCA GCA GAA 

Ser Ala Glu 
335 



Asp Lys Asn 
115 

GCC CCA TCA 

Ala Pro Ser 
130 

TGT GGA GGG 

Cys Gly Gly 



CAA GAT TAC 
Gin Asp Tyr 



TCT TTG AAA 
Ser Leu Lys 
180 

AGC GCA ACA 
Ser Ala Thr 
195 

GAT ATT ACA 
Asp lie Thr 
210 

GAA GCT GCA 
Glu Ala Ala 



GGG AAT ACG 
Gly Asn Thr 



GGA AAT GGA 
Gly Asn Gly 
260 

AAT CAG AGT 
Asn Gin Ser 
275 

GCC ATT TAT 
Ala lie Tyr 
290 

ATC TCC TTT 
lie Ser Phe 



GGA GCC ATT 
Gly Ala lie 



GCA GGG GAC 
Ala Gly Asp 
340 



Leu Ser Leu 



TCG GTA ATC 
Ser Val lie 
135 

GAT CTT ACA 
Asp Leu Thr 
150 

TGT GAG GAA 
Cys Glu Glu 
165 

AAC AGC ACG 
Asn Ser Thr 



GGG AAA AAA 
Gly Lys Lys 



AAT AAT ACG 
Asn Asn Thr 
215 

GGT GGA GCT 
Gly Gly Ala 
230 

TCT CTT GTA 
Ser Leu Val 
245 

GGA GCT CTT 
Gly Ala Leu 



GTA ACT TTC 
Val Thr Phe 



GCT AAG AAG 
Ala Lys Lys 
295 

TCT AAC AAT 
Ser Asn Asn 
310 

TCT ATA CTG 

Ser lie Leu 
325 

ATT ACC TTC 

lie Thr Phe 



Thr Gly Phe 

120 

ACA ACC CCC 

Thr Thr Pro 



TTT GAT AAC 
Phe Asp Asn 

AAT GGC GGA 
Asn Gly Gly 
170 

GGA TCG ATT 
Gly Ser lie 
185 

GGT GGG GCT 
Gly Gly Ala 
200 

GCT CCT ACC 
Ala Pro Thr 



ATA AAT AGC 
lie Asn Ser 



TTT TCT GAA 
Phe Ser Glu 
250 

TCT GGA GAT 
Ser Gly Asp 
265 

TCA GGA AAC 
Ser Gly Asn 
280 

CTT ACA CTG 
Leu Thr Leu 



ATA GTC CAA 
lie Val Gin 



GCA GCT GGA 
Ala Ala Gly 
330 

AAT GGG AAT 
Asn Gly Asn 
345 



Ser 



TCA 675 
Ser 



AAT 723 

Asn 

155 

GCC 771 
Ala 



TCT 819 
Ser 



ATT 8 67 

He 



CTC 915 
Leu 



ACA 9 63 

Thr 

235 

AAT 1011 
Asn 



GCC 1059 
Ala 



CAA 1107 
Gin 



GCT 1155 
Ala 



GGT 1203 

Gly 

315 

GAG 1251 
Glu 



GCC 1299 
Ala 
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ATT GTT GCA ACT ACA CCA CAA ACT ACA AAA AGA AAT TCT ATT GAC ATA 134 7 

lie Val Ala Thr Thr Pro Gin Thr Thr Lys Arg Asn Ser lie Asp He 
350 355 360 

GGA TCT ACT GCA AAG ATC ACG AAT TTA CGT GCA ATA TCT GGG CAT AGC 1395 
Gly Ser Thr Ala Lys He Thr Asn Leu Arg Ala He Ser Gly His Ser 
365 370 375 

ATC TTT TTC TAC GAT CCG ATT ACT GCT AAT ACG GCT GCG GAT TCT ACA 14 4 3 

He Phe Phe Tyr Asp Pro He Thr Ala Asn Thr Ala Ala Asp Ser Thr 
380 385 390 395 

GAT ACT TTA AAT CTC AAT AAG GCT GAT GCA GGT AAT AGT ACA GAT TAT 14 91 

Asp Thr Leu Asn Leu Asn Lys Ala Asp Ala Gly Asn Ser Thr Asp Tyr 
400 405 410 

AGT GGG TCG ATT GTT TTT TCT GGT GAA AAG CTC TCT GAA GAT GAA GCA 153 9 

Ser Gly Ser He Val Phe Ser Gly Glu Lys Leu Ser Glu Asp Glu Ala 
415 420 425 

AAA GTT GCA GAC AAC CTC ACT TCT ACG CTG AAG CAG CCT GTA ACT CTA 15 8 7 

Lys Val Ala Asp Asn Leu Thr Ser Thr Leu Lys Gin Pro Val Thr Leu 
430 435 440 

ACT GCA GGA AAT TTA GTA CTT AAA CGT GGT GTC ACT CTC GAT ACG AAA 1635 
Thr Ala Gly Asn Leu Val Leu Lys Arg Gly Val Thr Leu Asp Thr Lys 
445 450 455 

GGC TTT ACT CAG ACC GCG GGT TCC TCT GTT ATT ATG GAT GCG GGC ACA 1683 
Gly Phe Thr Gin Thr Ala Gly Ser Ser Val He Met Asp Ala Gly Thr 
460 465 470 475 

ACG TTA AAA GCA AGT ACA GAG GAG GTC ACT TTA ACA GGT CTT TCC ATT 17 31 

Thr Leu Lys Ala Ser Thr Glu Glu Val Thr Leu Thr Gly Leu Ser He 
480 485 490 

CCT GTA GAC TCT TTA GGC GAG GGT AAG AAA GTT GTA ATT GCT GCT TCT 177 9 

Pro Val Asp Ser Leu Gly Glu Gly Lys Lys Val Val He Ala Ala Ser 
495 500 505 

GCA GCA AGT AAA AAT GTA GCC CTT AGT GGT CCG ATT CTT CTT TTG GAT 182 7 

Ala Ala Ser Lys Asn Val Ala Leu Ser Gly Pro He Leu Leu "Leu Asp 
510 515 520 

AAC CAA GGG AAT GCT TAT GAA AAT CAC GAC TTA GGA AAA ACT CAA GAC 1875 
Asn Gin Gly Asn Ala Tyr Glu Asn His Asp Leu Gly Lys Thr Gin Asp 
525 530 . 535 

TTT TCA TTT GTG CAG CTC TCT GCT CTG GGT ACT GCA ACA ACT ACA GAT 1923 
Phe Ser Phe Val Gin Leu Ser Ala Leu Gly Thr Ala Thr Thr Thr Asp 
540 545 550 555 

GTT CCA GCG GTT CCT ACA GTA GCA ACT CCT ACG CAC TAT GGG TAT CAA 1971 
Val Pro Ala Val Pro Thr Val Ala Thr Pro Thr His Tyr Gly Tyr Gin 
560 565 570 

GGT ACT TGG GGA ATG ACT TGG GTT GAT GAT ACC GCA AGC ACT CCA AAG 2 019 
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Gly Thr Trp 



ACT AAG ACA 
Thr Lys Thr 
590 

CCT GAG CGT 
Pro Glu Arg 
605 

TCA GAC ATC 

Ser Asp lie 
620 

CTT TGT TCA 

Leu Cys Ser 



GAT AAA GAT 
Asp Lys Asp 



GGA TAT GCT 
Gly Tyr Ala 
670 

AGC TTT GCC 
Ser Phe Ala 
685 

GCT AAA AAT 
Ala Lys Asn 
700 

ATT ACA GAA 
lie Thr Glu 



GGC TCT TGG 
Gly Ser Trp 



AGC CAC GTC 
Ser His Val 
750 

GTG AAA GGT 
Val Lys Gly 
765 

TCT TCT CAT 
Ser Ser His 
780 

CCA TAC ATC 
Pro Tyr lie 



Gly Met Thr 
575 

GCG ACA TTA 
Ala Thr Leu 



CAA GGA CCT 
Gin Gly Pro 



CAA GCG ATT 
Gin Ala lie 
625 

GAT CGA GGC 
Asp Arg Gly 
640 

AAG AAA GGG 
Lys Lys Gly 
655 

ATC GGA GGT 
lie Gly Gly 



TTT TGC CAA 
Phe Cys Gin 



CAT ACT GAT 
His Thr Asp 
705 

TGT AGT GGG 
Cys Ser Gly 
720 

AGT CAT AAA 
Ser His Lys 
735 

AGT AAT GAT 
Ser Asn Asp 



TCT TGG GGG 
Ser Trp Gly 



TCT TAT CCT 
Ser Tyr Pro 
785 

AAA CTG AAT 
Lys Leu Asn 
800 



Trp Val Asp 
580 

GCT TGG ACC 
Ala Trp Thr 
595 

TTA GTT CCT 
Leu Val Pro 
610 

CAA GGT GTC 
Gin Gly Val 



TTC TGG GCT 
Phe Trp Ala 



GAA AAA CGC 
Glu Lys Arg 
660 

GCA GCG CAA 
Ala Ala Gin 
675 

CTC TTT GGT 
Leu Phe Gly 
690 

ACC TAT GCA 
Thr Tyr Ala 



TTC ATA GGT 
Phe lie Gly 



CCC CTC GTT 
Pro Leu Val 
740 

CTG AAG ACA 
Leu Lys Thr 
755 

AAT AAT GCT 
Asn Asn Ala 
770 

GAA TAC CTG 
Glu Tyr Leu 



CTG ACC TAT 
Leu Thr Tyr 



Asp Thr Ala 



AAT ACA GGC 
Asn Thr Gly 

AAT AGC CTT 
Asn Ser Leu 
615 

ATA GAG AGA 
lie Glu Arg 
630 

GCG GGA GTC 
Ala Gly Val 
645 

AAA TAC CGT 
Lys Tyr Arg 



ACT TGT TCT 
Thr Cys Ser 



AGC GAT AAA 

Ser Asp Lys 
695 

GGA GCC TTC 

Gly Ala Phe 
710 

TGT CTC TTA 

Cys Leu Leu 
725 

TTA GAA GGG 

Leu Glu Gly 



AAG TAT ACT 
Lys Tyr Thr 



TTT AAC ATG 

Phe Asn Met 
775 

CAT TGT TTT 

His Cys Phe 
790 

ATA CGT CAG 

lie Arg Gin 
805 



Ser Thr Pro 
585 

TAC CTT CCG 

Tyr Leu Pro 
600 

TGG GGA TCT 

Trp Gly Ser 

AGT GCT TTG 
Ser Ala Leu 



GCC AAT TTC 
Ala Asn Phe 
650 

CAT AAA TCT 
His Lys Ser 
665 

GAA AAC TTA 
Glu Asn Leu 
680 

GAT TTC TTA 
Asp Phe Leu 

TAT ATC CAA 
Tyr lie Gin 

GAT AAA CTT 
Asp Lys Leu 
730 

CAG CTC GCT 
Gin Leu Ala 
745 

GCG TAT CCT 
Ala Tyr Pro 
760 

ATG TTG GGA 
Met Leu Gly 



GAT ACC TAT 
Asp Thr Tyr 



GAC AGC TTC 
Asp Ser Phe 
810 



Lys 



AAT 2 0 67 

Asn 



TTT 2115 
Phe 



ACT 2163 

Thr 

635 

TTA 2211 
Leu 



GGT 2259 
Gly 



ATT 2307 
He 



GTC 2355 
Val 



CAC 2 4 03 

His 

715 

CCT 2451 
Pro 



TAT 2 4 9 9 

Tyr 



GAG 2547 
Glu 



GCT 2595 
Ala 



GCT 2643 

Ala 

795 

TCG 2691 
Ser 
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GAG AAA GGT 
Glu Lys Gly 



TTA TCT TTG 

Leu Ser Leu 
830 

GAC TTT TCT 

Asp Phe Ser 
845 

AAT GAT CCC 

Asn Asp Pro 
860 

GAA ACT TAT 

Glu Thr Tyr 



GGC AGT CAC 
Gly Ser His 



GTC TTT GAA 
Val Phe Glu 
910 



ACA GAA GGA 
Thr Glu Gly 
815 

CCT ATA GGG 
Pro lie Gly 



TAT GAT CTG 
Tyr Asp Leu 



AAA TGC ACT 

Lys Cys Thr 
865 

GCC AAT AAC 

Ala Asn Asn 
880 

TAC GCC TTC 

Tyr Ala Phe 
895 

GTT CGT GGA 

Val Arg Gly 



AGA TCT TTT 
Arg Ser Phe 
820 

GTG AAG TTT 
Val Lys Phe 
835 

ACT TTA TCC 
Thr Leu Ser 
850 

ACA GCA CTT 
Thr Ala Leu 



TTA GCA CGA 
Leu Ala Arg 



TCT CCT ATG 
Ser Pro Met 
900 

TCC 
Ser 



GAT GAC AGC 
Asp Asp Ser 



GAG AAG TTC 
Glu Lys Phe 



TAT GTT CCT 
Tyr Val Pro 
855 

GTA ATC AGC 
Val lie Ser 
870 

CAG GCC TTG 
Gin Ala Leu 
885 

TTT GAA GTG 
Phe Glu Val 



AAC CTC TTC 

Asn Leu Phe 
825 

TCT GAT TGT 

Ser Asp Cys 
840 

GAT CTT ATC 

Asp Leu lie 



GGA GCC TCT 
Gly Ala Ser 



CAA GTG CGT 
Gin Val Arg 
890 

CTC GGC CAG 
Leu Gly Gin 
905 



AAT 2739 
Asn 



AAT 2 7 8 7 

Asn 



CGC 2835 
Arg 



TGG 2883 

Trp 

875 

GCA 2 931 

Ala 



TTT 2979 
Phe 



3000 



(2) INFORMATION FOR SEQ ID NO : 2 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 914 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Lys Ser Gin Phe Ser Trp Leu Val Leu Ser Ser Thr Leu Ala Cys 
15 10 15 

Phe Thr Ser Cys Ser Thr Val Phe Ala Ala Thr Ala Glu Asn lie Gly 
20 25 30 

Pro Ser Asp Ser Phe Asp Gly Ser Thr Asn Thr Gly Thr Tyr Thr Pro 
35 40 45 

Lys Asn Thr Thr Thr Gly lie Asp Tyr Thr Leu Thr Gly Asp lie Thr 
50 55 60 

Leu Gin Asn Leu Gly Asp Ser Ala Ala Leu Thr Lys Gly Cys Phe Ser 
65 70 75 80 
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Asp Thr Thr Glu Ser Leu Ser Phe Ala Gly Lys Gly Tyr Ser Leu Ser 
85 90 95 

Phe Leu Asn lie Lys Ser Ser Ala Glu Gly Ala Ala Leu Ser Val Thr 
100 105 110 

Thr Asp Lys Asn Leu Ser Leu Thr Gly Phe Ser Ser Leu Thr Phe Leu 
115 120 125 

Ala Ala Pro Ser Ser Val lie Thr Thr Pro Ser Gly Lys Gly Ala Val 
130 135 140 

Lys Cys Gly Gly Asp Leu Thr Phe Asp Asn Asn Gly Thr lie Leu Phe 
145 150 155 160 

Lys Gin Asp Tyr Cys Glu Glu Asn Gly Gly Ala lie Ser Thr Lys Asn 
165 170 175 

Leu Ser Leu Lys Asn Ser Thr Gly Ser lie Ser Phe Glu Gly Asn Lys 
180 185 190 

Ser Ser Ala Thr Gly Lys Lys Gly Gly Ala lie Cys Ala Thr Gly Thr 
195 200 205 

Val Asp lie Thr Asn Asn Thr Ala Pro Thr Leu Phe Ser Asn Asn lie 
210 215 220 

Ala Glu Ala Ala Gly Gly Ala lie Asn Ser Thr Gly Asn Cys Thr lie 
225 230 235 240 

Thr Gly Asn Thr Ser Leu Val Phe Ser Glu Asn Ser Val Thr Ala Thr 
245 250 255 

Ala Gly Asn Gly Gly Ala Leu Ser Gly Asp Ala Asp Val Thr lie Ser 
260 265 270 

Gly Asn Gin Ser Val Thr Phe Ser Gly Asn Gin Ala Val Ala Asn Gly 
275 280 285 

Gly Ala lie Tyr Ala Lys Lys Leu Thr Leu Ala Ser Gly Gly Gly Gly 
290 295 300 

Gly lie Ser Phe Ser Asn Asn lie Val Gin Gly Thr Thr Ala Gly Asn 
305 310 315 320 

Gly Gly Ala lie Ser lie Leu Ala Ala Gly Glu Cys Ser Leu Ser Ala 
325 330 335 

Glu Ala Gly Asp lie Thr Phe Asn Gly Asn Ala lie Val Ala Thr Thr 
340 345 350 

Pro Gin Thr Thr Lys Arg Asn Ser lie Asp lie Gly Ser Thr Ala Lys 
355 360 365 



lie Thr Asn Leu Arg Ala lie Ser Gly His Ser lie Phe Phe Tyr Asp 
370 375 380 
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Pro lie Thr Ala Asn Thr Ala Ala Asp Ser Thr Asp Thr Leu Asn Leu 
385 390 395 400 

Asn Lys Ala Asp Ala Gly Asn Ser Thr Asp Tyr Ser Gly Ser lie Val 
405 410 415 

Phe Ser Gly Glu Lys Leu Ser Glu Asp Glu Ala Lys Val Ala Asp Asn 

420 425 430 

Leu Thr Ser Thr Leu Lys Gin Pro Val Thr Leu Thr Ala Gly Asn Leu 

435 440 445 

Val Leu Lys Arg Gly Val Thr Leu Asp Thr Lys Gly Phe Thr Gin Thr 
450 455 460 

Ala Gly Ser Ser Val lie Met Asp Ala Gly Thr Thr Leu Lys Ala Ser 
465 470 475 480 

Thr Glu Glu Val Thr Leu Thr Gly Leu Ser lie Pro Val Asp Ser Leu 
485 490 495 

Gly Glu Gly Lys Lys Val Val lie Ala Ala Ser Ala Ala Ser Lys Asn 
500 505 510 

Val Ala Leu Ser Gly Pro lie Leu Leu Leu Asp Asn Gin Gly Asn Ala 
515 520 525 

Tyr Glu Asn His Asp Leu Gly Lys Thr Gin Asp Phe Ser Phe Val Gin 
530 535 540 

Leu Ser Ala Leu Gly Thr Ala Thr Thr Thr Asp Val Pro Ala Val Pro 
545 550 555 560 

Thr Val Ala Thr Pro Thr His Tyr Gly Tyr Gin Gly Thr Trp Gly Met 
565 570 575 

Thr Trp Val Asp Asp Thr Ala Ser Thr Pro Lys Thr Lys Thr Ala Thr 
580 585 590 

Leu Ala Trp Thr Asn Thr Gly Tyr Leu Pro Asn Pro Glu Arg Gin Gly 
595 600 605 

Pro Leu Val Pro Asn Ser Leu Trp Gly Ser Phe Ser Asp lie Gin Ala 
610 615 620 

lie Gin Gly Val lie Glu Arg Ser Ala Leu Thr Leu Cys Ser Asp Arg 
625 630 635 640 

Gly Phe Trp Ala Ala Gly Val Ala Asn Phe Leu Asp Lys Asp Lys Lys 
645 . 650 655 

Gly Glu Lys Arg Lys Tyr Arg His Lys Ser Gly Gly Tyr Ala lie Gly 
660 665 670 

Gly Ala Ala Gin Thr Cys Ser Glu Asn Leu lie Ser Phe Ala Phe Cys 
675 680 685 

Gin Leu Phe Gly Ser Asp Lys Asp Phe Leu Val Ala Lys Asn His Thr 
690 695 700 
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Asp Thr Tyr Ala Gly Ala Phe Tyr lie Gin His lie Thr Glu Cys Ser 
705 710 715 720 

Gly Phe lie Gly Cys Leu Leu Asp Lys Leu Pro Gly Ser Trp Ser His 
725 730 735 

Lys Pro Leu .Val Leu Glu Gly Gin Leu Ala Tyr Ser His Val Ser Asn 
740 745 750 

Asp Leu Lys Thr Lys Tyr Thr Ala Tyr Pro Glu Val Lys Gly Ser Trp 
755 760 765 

Gly Asn Asn Ala Phe Asn Met Met Leu Gly Ala Ser Ser His Ser Tyr 
770 775 780 

Pro Glu Tyr Leu His Cys Phe Asp Thr Tyr Ala Pro Tyr lie Lys Leu 
785 790 795 800 

Asn Leu Thr Tyr lie Arg Gin Asp Ser Phe Ser Glu Lys Gly Thr Glu 
805 810 815 

Gly Arg Ser Phe Asp Asp Ser Asn Leu Phe Asn Leu Ser Leu Pro lie 
820 825 830 

Gly Val Lys Phe Glu Lys Phe Ser Asp Cys Asn Asp Phe Ser Tyr Asp 
835 840 845 

Leu Thr Leu Ser Tyr Val Pro Asp Leu lie Arg Asn Asp Pro Lys Cys 
850 855 860 

Thr Thr Ala Leu Val lie Ser Gly Ala Ser Trp Glu Thr Tyr Ala Asn 
865 870 875 880 

Asn Leu Ala Arg Gin Ala Leu Gin Val Arg Ala Gly Ser His Tyr Ala 
885 890 895 

Phe Ser Pro Met Phe Glu Val Leu Gly Gin Phe Val Phe Glu Val Arg 
900 905 910 

Gly Ser 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 1. . . 1200 
(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GAT CCT AAA AAT AAA GAG TAC ACA GGG ACC ATA CTC TTT TCT GGA GAA 4 8 

Asp Pro Lys Asn Lys Glu Tyr Thr Gly Thr lie Leu Phe Ser Gly Glu 
15 10 15 

AAG AGT CTA GCA AAC GAT CCT AGG GAT TTT AAA TCT ACA ATC CCT CAG 9 6 

Lys Ser Leu Ala Asn Asp Pro Arg Asp Phe Lys Ser Thr lie Pro Gin 
20 25 30 

AAC GTC AAC CTG TCT GCA GGA TAC TTA GTT ATT AAA GAG GGG GCC GAA 14 4 

Asn Val Asn Leu Ser Ala Gly Tyr Leu Val lie Lys Glu Gly Ala Glu 
35 40 45 

GTC ACA GTT TCA AAA TTC ACG CAG TCT CCA GGA TCG CAT TTA GTT TTA 192 

Val Thr Val Ser Lys Phe Thr Gin Ser Pro Gly Ser His Leu Val Leu 
50 55 60 

GAT TTA GGA ACC AAA CTG ATA GCC TCT AAG GAA GAC ATT GCC ATC ACA 24 0 

Asp Leu Gly Thr Lys Leu lie Ala Ser Lys Glu Asp lie Ala lie Thr 
65 70 75 80 

GGC CTC GCG ATA GAT ATA GAT AGC TTA AGC TCA TCC TCA ACA GCA GCT 288 

Gly Leu Ala lie Asp lie Asp Ser Leu Ser Ser Ser Ser Thr Ala Ala 
85 90 95 

GTT ATT AAA GCA AAC ACC GCA AAT AAA CAG ATA TCC GTG ACG GAC TCT 33 6 

Val lie Lys Ala Asn Thr Ala Asn Lys Gin lie Ser Val Thr Asp Ser 
100 105 110 

ATA GAA CTT ATC TCG CCT ACT GGC AAT GCC TAT GAA GAT CTC AGA ATG 384 

lie Glu Leu lie Ser Pro Thr Gly Asn Ala Tyr Glu Asp Leu Arg Met 
115 120 125 

AGA AAT TCA CAG ACG TTC CCT CTG CTC TCT TTA GAG CCT GGA GCC GGG 4 32 

Arg Asn Ser Gin Thr Phe Pro Leu Leu Ser Leu Glu Pro Gly Ala Gly 
130 135 140 

GGT AGT GTG ACT GTA ACT GCT GGA GAT TTC CTA CCG GTA AGT CCC CAT 4 80 

Gly Ser Val Thr Val Thr Ala Gly Asp Phe Leu Pro Val Ser Pro His 

145 150 155 160 

TAT GGT TTT CAA GGC AAT TGG AAA TTA GCT TGG ACA GGA ACT GGA AAC 52 8 

Tyr Gly Phe Gin Gly Asn Trp Lys Leu Ala Trp Thr Gly Thr Gly Asn 
165 170 175 

AAA GTT GGA GAA TTC TTC TGG GAT AAA ATA AAT TAT AAG CCT AGA CCT 57 6 

Lys Val Gly Glu Phe Phe Trp Asp Lys lie Asn Tyr Lys Pro Arg Pro 
180 185 190 

GAA AAA GAA GGA AAT TTA GTT CCT AAT ATC TTG TGG GGG AAT GCT GTA 62 4 

Glu Lys Glu Gly Asn Leu Val Pro Asn lie Leu Trp Gly Asn Ala Val 
195 200 205 

AAT GTC AGA TCC TTA ATG CAG GTT CAA GAG ACC CAT GCA TCG AGC TTA 672 

Asn Val Arg Ser Leu Met Gin Val Gin Glu Thr His Ala Ser Ser Leu 
210 . 215 220 
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CAG ACA GAT 
Gin Thr Asp 
225 

GTA TCT GCC 
Val Ser Ala 



TAT GTT CTA 
Tyr Val Leu 



ATG GCA TTT 
Met Ala Phe 
275 

AAC AAC GAA 
Asn Asn Glu 
290 

ACC TCC CTA 
Thr Ser Leu 
305 

AAC GTC GGG 
Asn Val Gly 



TTT CAT TTT 
Phe His Phe 



GAC TAC GCA 
Asp Tyr Ala 
355 

TGG GCT ATA 
Trp Ala lie 
370 

GGA AAA CTT 
Gly Lys Leu 
385 



CGA GGG CTG 
Arg Gly Leu 
230 

TCC GAA GAC 
Ser Glu Asp 
245 

TCT GTA AAT 
Ser Val Asn 
260 

TCC CAA CTC 
Ser Gin Leu 



TAC AGA ATG 
Tyr Arg Met 



GGG AAT ATT 

Gly Asn lie 
310 

ATT CTC TCA 

lie Leu Ser 
325 

TTG TGT GCT 

Leu Cys Ala 
340 

AAT TTC CCT 

Asn Phe Pro 



AAA TGC GGA 
Lys Cys Gly 



TTC CAA GGT 
Phe Gin Gly 
390 



TGG ATC GAT 
Trp lie Asp 



AAT ATA AGG 
Asn lie Arg 



AAT GAG ATC 
Asn Glu lie 
265 

TTT AGT AGA 
Phe Ser Arg 
280 

TAT TTA GGA 
Tyr Leu Gly 
295 

TTC CGT TAT 
Phe Arg Tyr 



AGA AGG TTT 
Arg Arg Phe 



TAT GGT CAT 
Tyr Gly His 
345 

ATG GTG AAA 
Met Val Lys 
360 

GGG AGC ATG 
Gly Ser Met 
375 

GCC ATC CCA 
Ala lie Pro 



GGA ATT GGG 

Gly lie Gly 
235 

TAC CGT CAT 

Tyr Arg His 
250 

ACA CCT AAG 

Thr Pro Lys 



GAC AAA GAC 
Asp Lys Asp 



TCG TAT CTC 
Ser Tyr Leu 
300 

GCT TCG CGT 
Ala Ser Arg 
315 

CTT CAA AAT 
Leu Gin Asn 
330 

GCC ACC AAT 
Ala Thr Asn 



AAC AGC TGG 
Asn Ser Trp 



CCT CTA TTG 
Pro Leu Leu 
380 

TTT ATG AAA 
Phe Met Lys 
395 



AAT TTC TTC 
Asn Phe Phe 



AAC AGC GGT 
Asn Ser Gly 
255 

CAC TAT ACT 
His Tyr Thr 
270 

TAT GCG GTT 
Tyr Ala Val 
285 

TAT CAA TAT 
Tyr Gin Tyr 



AAC CCT AAT 
Asn Pro Asn 



CCT CTT ATG 
Pro Leu Met 
335 

GAT ATG AAA 
Asp Met Lys 
350 

AGA AAC AAT 
Arg Asn Asn 
365 

GTA TTT GAA 
Val Phe Glu 



CTA CAA TTA 
Leu Gin Leu 



CAT 720 

His 

240 

GGA 7 68 

Gly 



TCG 816 
Ser 



TCC 864 
Ser 



ACA 912 
Thr 



GTA 960 

Val 

320 

ATT 1008 

He 



ACA 1056 
Thr 



TGT 1104 
Cys 



AAC 1152 
Asn 



GTT 1200 

Val 

400 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 8 : 
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Asp Pro Lys Asn Lys Glu Tyr Thr Gly Thr lie Leu Phe Ser Gly Glu 
15 10 15 

Lys Ser Leu Ala Asn Asp Pro Arg Asp Phe Lys Ser Thr lie Pro Gin 
20 25 30 

Asn Val Asn Leu Ser Ala Gly Tyr Leu Val lie Lys Glu Gly Ala Glu 
35 40 45 

Val Thr Val Ser Lys Phe Thr Gin Ser Pro Gly Ser His Leu Val Leu 
50 55 60 

Asp Leu Gly Thr Lys Leu lie Ala Ser Lys Glu Asp lie Ala lie Thr 
65 70 75 80 

Gly Leu Ala lie Asp lie Asp Ser Leu Ser Ser Ser Ser Thr Ala Ala 
85 90 95 

Val lie Lys Ala Asn Thr Ala Asn Lys Gin lie Ser Val Thr Asp Ser 
100 105 110 

lie Glu Leu lie Ser Pro Thr Gly Asn Ala Tyr Glu Asp Leu Arg Met 
115 120 125 

Arg Asn Ser Gin Thr Phe Pro Leu Leu Ser Leu Glu Pro Gly Ala Gly 
130 135 140 

Gly Ser Val Thr Val Thr Ala Gly Asp Phe Leu Pro Val Ser Pro His 
145 150 155 160 

Tyr Gly Phe Gin Gly Asn Trp Lys Leu Ala Trp Thr Gly Thr Gly Asn 
165 170 175 

Lys Val Gly Glu Phe Phe Trp Asp Lys lie Asn Tyr Lys Pro Arg Pro 
180 185 190 

Glu Lys Glu Gly Asn Leu Val Pro Asn lie Leu Trp Gly Asn Ala Val 
195 200 205 

Asn Val Arg Ser Leu Met Gin Val Gin Glu Thr His Ala Ser Ser Leu 
210 215 220 

Gin Thr Asp Arg Gly Leu Trp lie Asp Gly lie Gly Asn Phe Phe His 
225 230 235 240 

Val Ser Ala Ser Glu Asp Asn lie Arg Tyr Arg His Asn Ser Gly Gly 
245 250 255 

Tyr Val Leu Ser Val Asn Asn Glu lie Thr Pro Lys His Tyr Thr Ser 
260 265 270 

Met Ala Phe Ser Gin Leu Phe Ser Arg Asp Lys Asp Tyr Ala Val Ser 
275 280 285 



Asn Asn Glu Tyr Arg Met Tyr Leu Gly Ser Tyr Leu Tyr Gin Tyr Thr 
290 295 300 
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Thr Ser Leu Gly 
305 

Asn Val Gly lie 



Phe His Phe Leu 
340 

Asp Tyr Ala Asn 
355 

Trp Ala lie Lys 
370 

Gly Lys Leu Phe 
385 



Asn lie Phe Arg 
310 

Leu Ser Arg Arg 
325 

Cys Ala Tyr Gly 



Phe Pro Met Val 
360 

Cys Gly Gly Ser 
375 

Gin Gly Ala lie 
390 



Tyr Ala Ser Arg 
315 

Phe Leu Gin Asn 
330 

His Ala Thr Asn 
345 

Lys Asn Ser Trp 



Met Pro Leu Leu 
380 

Pro Phe Met Lys 

395 



Asn Pro Asn Val 
320 

Pro Leu Met lie 
335 

Asp Met Lys Thr 
350 

Arg Asn Asn Cys 
365 

Val Phe Glu Asn 



Leu Gin Leu Val 
400 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 1...1830 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 

GAT CTC ACA TTA GGG AGT CGT GAC AGT TAT AAT GGT GAT ACA AGC ACC 4 8 

Asp Leu Thr Leu Gly Ser Arg Asp Ser Tyr Asn Gly Asp Thr Ser Thr 
15 10 15 

ACA GAA TTT ACT CCT AAA GCG GCA ACT TCT GAT GCT AGT GGC ACG ACC 9 6 

Thr Glu Phe Thr Pro Lys Ala Ala Thr Ser Asp Ala Ser Gly Thr Thr 
20 25 30 

TAT ATT CTC GAT GGG GAT GTC TCG ATA AGC CAA GCA GGG AAA CAA ACG 14 4 

Tyr lie Leu Asp Gly Asp Val Ser lie Ser Gin Ala Gly Lys Gin Thr 
35 . 40 45 

AGC TTA ACC ACA AGT TGT TTT TCT AAC ACT GCA GGA AAT CTT ACC TTC 192 
Ser Leu Thr Thr Ser Cys Phe Ser Asn Thr Ala Gly Asn Leu Thr Phe 
50 55 60 

TTA GGG AAC GGA TTT TCT CTT CAT TTT GAC AAT ATT ATT TCG TCT ACT 24 0 

Leu Gly Asn Gly Phe Ser Leu His Phe Asp Asn lie lie Ser Ser Thr 
65 70 75 80 



GTT GCA GGT GTT GTT GTT AGC AAT ACA GCA GCT 
Val Ala Gly Val Val Val Ser Asn Thr Ala Ala 



TCT GGG ATT ACG AAA 
Ser Gly lie Thr Lys 
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TTC TCA GGA 
Phe Ser Gly 



GGT AAA GGA 
Gly Lys Gly 
115 

GGG AAT CTT 
Gly Asn Leu 
130 

GCT CTC AAT 
Ala Leu Asn 
145 

GGA ACC ATA 
Gly Thr lie 



GAT GAG AAG 
Asp Glu Lys 



AAT GGG ACT 
Asn Gly Thr 
195 

TTC TCT CAG 
Phe Ser Gin 
210 

TTG GTT GCA 
Leu Val Ala 
225 

ATA GAC TCT 
lie Asp Ser 



GCT CAG AAA 
Ala Gin Lys 



GAT GAG AGT 
Asp Glu Ser 
275 

GAT GGG ATT 
Asp Gly lie 
290 

GAT TCT CGC 
Asp Ser Arg 
305 



85 

TTT TCA ACT 
Phe Ser Thr 
100 

GCC ATT AAA 
Ala lie Lys 



GAT CCG ATT 
Asp Pro lie 



ATT AAT AGC 

lie Asn Ser 
150 

GTC TTT TCT 

Val Phe Ser 
165 

AAC CGC ACT 

Asn Arg Thr 
180 

GTA GTT TTA 

Val Val Leu 



GAT GCA AAC 
Asp Ala Asn 



AAC ACC GAA 
Asn Thr Glu 
230 

CTC AGG AAC 
Leu Arg Asn 
245 

GAT ATT CGT 
Asp lie Arg 
260 

TTT TAT CAA 
Phe Tyr Gin 

CTT GAG TTA 
Leu Glu Leu 



AGT ATA GAT 
Ser lie Asp 
310 



CTT CGG ATG 
Leu Arg Met 
105 

ATT ACC GAT 
lie Thr Asp 
120 

ACT GTA ACA 
Thr Val Thr 

135 

CCT GAT ACT 
Pro Asp Thr 



GGA GAG AAG 
Gly Glu Lys 

TCT AAA TTA 
Ser Lys Leu 
185 

AAA GGT GAT 
Lys Gly Asp 
200 

TCT AAG TTG 

Ser Lys Leu 
215 

AGT ATC GAG 

Ser lie Glu 



GGG AAA AAG 
Gly Lys Lys 



ATA GAT CGT 
lie Asp Arg 
265 

AAT GGC TTT 
Asn Gly Phe 
280 

GAT GCT GGG 
Asp Ala Gly 
295 

GCT GTA CAA 
Ala Val Gin 
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90 

CTT GCA GCT 
Leu Ala Ala 



GGT CTG GTG 
Gly Leu Val 



GGA TCG ACA 
Gly Ser Thr 
140 

GGA GAT AAC 
Gly Asp Asn 
155 

CTC ACG GAG 
Leu Thr Glu 
170 

CTT CAA AAT 
Leu Gin Asn 



GTC GTT TTA 
Val Val Leu 



ATT ATG GAT 
lie Met Asp 
220 

TTA ACG AAT 
Leu Thr Asn 
235 

ATA AAA CTC 
lie Lys Leu 
250 

CCT GTT GTA 
Pro Val Val 



TTG AAT GAG 
Leu Asn Glu 



AAA GAC ATC 
Lys Asp lie 
300 

TCT CCG TAT 
Ser Pro Tyr 
315 



95 

CCT AGG ACC 
Pro Arg Thr 
110 

TTT GAG AGT 
Phe Glu Ser 
125 

TCT GTT GCT 
Ser Val Ala 



AAA GAG TAT 
Lys Glu Tyr 



GCA GAA GCT 
Ala Glu Ala 
175 

GTT GCT TTT 
Val Ala Phe 
190 

AGT GCG AAC 
Ser Ala Asn 
205 

TTA GGG ACG 
Leu Gly Thr 



TTG GAA ATT 
Leu Glu lie 



AGT GCT GCC 
Ser Ala Ala 
255 

CTG GCA ATT 
Leu Ala lie 
270 

GAC CAT TCC 
Asp His Ser 
285 

GTG ATT TCT 
Val lie Ser 



GGC TAT CAG 
Gly Tyr Gin 



ACA 336 
Thr 



ATA 384 
He 



GAT 4 32 

Asp 



ACG 4 80 

Thr 

160 

AAA 528 
Lys 

AAA 576 
Lys 

GGT 62 4 

Gly 



TCG 672 
Ser 



AAT 72 0 

Asn 

240 

ACA 7 68 

Thr 



AGC 816 
Ser 



TAT 8 64 

Tyr 



GCA 912 
Ala 



GGA 9 60 

Gly 

320 
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AAG TGG ACG 
Lys Trp Thr 



TGG GCG AAG 
Trp Ala Lys 



GTT CCT AAT 

Val Pro Asn 
355 

AAT TTT ATA 

Asn Phe lie 
370 

TGG GTT GCA 

Trp Val Ala 
385 

CAA AGG AAA 

Gin Arg Lys 



ACG AGG ATG 
Thr Arg Met 



TTT GCG CGT 
Phe Ala Arg 
435 

TAC GCA GGA 
Tyr Ala Gly 
450 

GTG AGT ATC 
Val Ser lie 
465 

TAT GTT TCC 
Tyr Val Ser 



GGC CAT ACG 
Gly His Thr 



CCG ACG CTC 
Pro Thr Leu 
515 

GGA GAG CTG 
Gly Glu Leu 
530 

TTT TTC CGA 
Phe Phe Arg 



ATC AAT TGG 

lie Asn Trp 

325 

CAG AGT TTT 

Gin Ser Phe 
340 

CTT CTT TGG 

Leu Leu Trp 



GAG CTA GGT 
Glu Leu Gly 



GGC ATT TCC 
Gly lie Ser 
390 

TTC CGT CAT 
Phe Arg His 
405 

CCG GGT GGT 
Pro Gly Gly 
420 

GAC AAA GAC 
Asp Lys Asp 

TCT TTA CGT 
Ser Leu Arg 

CTT TTA GGA 
Leu Leu Gly 
470 

AAT ACT CTG 
Asn Thr Leu 
485 

GAT CAT CGC 
Asp His Arg 
500 

TCG ACG GAT 
Ser Thr Asp 



GGA ACT CGA 
Gly Thr Arg 



GAG TAC ACT 
Glu Tyr Thr 



TCT ACT GAT 
Ser Thr Asp 



AAT CCC ACT 
Asn Pro Thr 
345 

GGT TCT TTT 
Gly Ser Phe 
360 

ACT GAA GGT 
Thr Glu Gly 
375 

AAT GTT TTG 
Asn Val Leu 



GTG AGT GGA 
Val Ser Gly 



GAT ACC TTG 
Asp Thr Leu 
425 

TAC TTT ATG 
Tyr Phe Met 
440 

TTG CAG CAC 
Leu Gin His 
455 

GAG GGA GGA 
Glu Gly Gly 



CCG TGC TCT 
Pro Cys Ser 



ATG AAG ACC 
Met Lys Thr 
505 

CAT ACT TCT 
His Thr Ser 
520 

GTT GCT GTT 
Val Ala Val 

535 

CCA TTT GTA 
Pro Phe Val 



GAT AAG AAA 

Asp Lys Lys 
330 

GCT GAG CAG 

Ala Glu Gin 



ATA GAT GTT 
lie Asp Val 



GCT CCT TAC 

Ala Pro Tyr 
380 

CAT AGG AGC 

His Arg Ser 
395 

GGT GCT GTA 

Gly Ala Val 
410 

TCT CTG GGT 

Ser Leu Gly 



AAT ACC AAT 
Asn Thr Asn 



GAT GCT TCC 

Asp Ala Ser 
460 

CTC CGC GAG 

Leu Arg Glu 
475 

TTC TAT GGG 

Phe Tyr Gly 
490 

GAG TCT CTA 

Glu Ser Leu 



TGG GGA GGA 
Trp Gly Gly 



GAA AAT ACC 
Glu Asn Thr 
540 

AAA GTC CAA 
Lys Val Gin 



GCT ACG GTT 
Ala" Thr Val 
335 

GAG GCT CCG 
Glu Ala Pro 
35 0 

CGT TCC TTC 
Arg Ser Phe 
365 

GAA AAG AGA 
Glu Lys Arg 

GGT CGT GAA 
Gly Arg Glu 



GTA GGT GCT 
Val Gly Ala 
415 

TTT GCT CAG 
Phe Ala Gin 
430 

TTC GCA AAG 
Phe Ala Lys 
445 

CTA TAC TCT 
Leu Tyr Ser 



ATC CTG TTG 
lie Leu Leu 



CAG CTT AGC 
Gin Leu Ser 
495 

CCC CCC CCC 
Pro Pro Pro 
510 

TAT GTC TGG 
Tyr Val Trp 
525 

AGC GGC AGA 
Ser Gly Arg 



GCT GTT TAC 
Ala Val Tyr 



TCT 1008 
Ser 



TTA 105 6 
Leu 



CAG 1104 
Gin 



TTT 1152 
Phe 



AAT 1200 

Asn 

400 

AGC 12 4 8 

Ser 



CTC 1296 
Leu 



ACC 1344 
Thr 



GTG 1392 
Val 



CCT 1440 

Pro 

480 

TAC 14 88 

Tyr 



CCC 1536 
Pro 



GCT 1584 
Ala 



GGA 1632 
Gly 



TCG 1680 
Ser 
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545 550 555 560 

CGC CAA GAT AGC TTT GTT GAA CTA GGA GCT ATC AGT CGT GAT TTT AGT 172 8 

Arg Gin Asp Ser Phe Val Glu Leu Gly Ala lie Ser Arg Asp Phe Ser 
565 570 575 

GAT TCG CAT CTT TAT AAC CTT GCG ATT CCT CTT GGA ATC AAG TTA GAG 177 6 

Asp Ser His Leu Tyr Asn Leu Ala lie Pro Leu Gly lie Lys Leu Glu 
580 585 590 

AAA CGG TTT GCA GAG CAA TAT TAT CAT GTT GTT GCG ATG TAT TCT CCA 182 4 

Lys Arg Phe Ala Glu Gin Tyr Tyr His Val Val Ala Met Tyr Ser Pro 
595 600 605 

GAT GTT 1830 
Asp Val 
610 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 0 : 

Asp Leu Thr Leu Gly Ser Arg Asp Ser Tyr Asn Gly Asp Thr Ser Thr 
15 10 15 

Thr Glu Phe Thr Pro Lys Ala Ala Thr Ser Asp Ala Ser Gly Thr Thr 
20 25 30 

Tyr lie Leu Asp Gly Asp Val Ser lie Ser Gin Ala Gly Lys Gin Thr 
35 40 45 

Ser Leu Thr Thr Ser Cys Phe Ser Asn Thr Ala Gly Asn Leu Thr Phe 
50 55 60 

Leu Gly Asn Gly Phe Ser Leu His Phe Asp Asn lie lie Ser Ser Thr 
65 70 75 80 

Val Ala Gly Val Val Val Ser Asn Thr Ala Ala Ser Gly lie Thr Lys 
85 90 95 

Phe Ser Gly Phe Ser Thr Leu Arg Met Leu Ala Ala Pro Arg Thr Thr 
100 105 110 

Gly Lys Gly Ala lie Lys lie Thr Asp Gly Leu Val Phe Glu Ser lie 
115 120 125 



Gly Asn Leu Asp Pro lie Thr Val Thr Gly Ser Thr Ser Val Ala Asp 
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130 135 140 

Ala Leu Asn lie Asn Ser Pro Asp Thr Gly Asp Asn Lys Glu Tyr Thr 
145 150 155 160 

Gly Thr lie Val Phe Ser Gly Glu Lys Leu Thr Glu Ala Glu Ala Lys 
165 170 - 175 

Asp Glu Lys Asn Arg Thr Ser Lys Leu Leu Gin Asn Val Ala Phe Lys 
180 185 190 

Asn Gly Thr Val Val Leu Lys Gly Asp Val Val Leu Ser Ala Asn Gly 
195 200 205 

Phe Ser Gin Asp Ala Asn Ser Lys Leu lie Met Asp Leu Gly Thr Ser 
210 215 220 

Leu Val Ala Asn Thr Glu Ser lie Glu Leu Thr Asn Leu Glu lie Asn 
225 230 235 240 

lie Asp Ser Leu Arg Asn Gly Lys Lys lie Lys Leu Ser Ala Ala Thr 
245 250 255 

Ala Gin Lys Asp lie Arg lie Asp Arg Pro Val Val Leu Ala lie Ser 
260 265 270 

Asp Glu Ser Phe Tyr Gin Asn Gly Phe Leu Asn Glu Asp His Ser Tyr 
275 280 285 

Asp Gly lie Leu Glu Leu Asp Ala Gly Lys Asp lie Val lie Ser Ala 
290 295 300 

Asp Ser Arg Ser lie Asp Ala Val Gin Ser Pro Tyr Gly Tyr Gin Gly 
305 310 315 320 

Lys Trp Thr lie Asn Trp Ser Thr Asp Asp Lys Lys Ala Thr Val Ser 
325 330 335 

Trp Ala Lys Gin Ser Phe Asn Pro Thr Ala Glu Gin Glu Ala Pro Leu 
340 345 350 

Val Pro Asn Leu Leu Trp Gly Ser Phe lie Asp Val Arg Ser Phe Gin 
355 360 365 

Asn Phe lie Glu Leu Gly Thr Glu Gly Ala Pro Tyr Glu Lys Arg Phe 
370 375 380 

Trp Val Ala Gly lie Ser Asn Val Leu His Arg Ser Gly Arg Glu Asn 
385 390 395 400 

Gin Arg Lys Phe Arg His Val Ser Gly Gly Ala Val Val Gly Ala Ser 
405 410 415 

Thr Arg Met Pro Gly Gly Asp Thr Leu Ser Leu Gly Phe Ala Gin Leu 
420 425 430 

Phe Ala Arg Asp Lys Asp Tyr Phe Met Asn Thr Asn Phe Ala Lys Thr 
435 440 445 



Tyr Ala Gly Ser 
450 

Val Ser lie Leu 
465 

Tyr Val Ser Asn 



Gly His Thr Asp 
500 

Pro Thr Leu Ser 
515 

Gly Glu Leu Gly 
530 

Phe Phe Arg Glu 
545 

Arg Gin Asp Ser 



Asp Ser His Leu 
580 

Lys Arg Phe Ala 
595 

Asp Val 
610 



Leu Arg Leu Gin 
455 

Leu Gly Glu Gly 
470 

Thr Leu Pro Cys 
485 

His Arg Met Lys 



Thr Asp His Thr 
520 

Thr Arg Val Ala 
535 

Tyr Thr Pro Phe 
550 

Phe Val Glu Leu 
565 

Tyr Asn Leu Ala 



Glu Gin Tyr Tyr 
600 



His Asp Ala Ser 
460 

Gly Leu Arg Glu 
475 

Ser Phe Tyr Gly 
490 

Thr Glu Ser Leu 
505 

Ser Trp Gly Gly 



Val Glu Asn Thr 
540 

Val Lys Val Gin 
555 

Gly Ala lie Ser 
570 

lie Pro Leu Gly 
585 

His Val Val Ala 



Leu Tyr Ser Val 



lie Leu Leu Pro 
480 

Gin Leu Ser Tyr 
495 

Pro Pro Pro Pro 
510 

Tyr Val Trp Ala 
525 

Ser Gly Arg Gly 



Ala Val Tyr Ser 
560 

Arg Asp Phe Ser 
575 

lie Lys Leu Glu 
590 

Met Tyr Ser Pro 
605 



